FIGURE 1 

GGGGCTTCGGCGCCAGCGGCCAGCGCTAGTCGGTCTGGTAAGGATTTACAAAAGGTGCAGGTATG 
AGCAGGTCTGAAGAC T AACATT T T GTGAAGT TGTAAAACAGAAAACC TGTT AGAAATGTGGT GGT 
TTCAGCAAGGCCTCAGTTTCCTTCCTTCAGCCCTTGTAATTTGGACATCTGCTGCTTTCATATTT 
TCATACAT T AC TGCAGT AACAC T C C ACCAT AT AGACCCGGCT TT ACC T TAT AT C AGTGACACT GG 
TACAGTAGCTCCAGAAAAATGCTTATTTGGGGCAATGCTAAATATTGCGGCAGTTTTATGCATTG 
C TACCAT T TATGT TCG TT AT A&GCAAG T T CAT GC TCT GAGT CC T GAAGAGAACGT TAT CAT CAAA 
TTAAACAAGGCTGGCCTTGTACTTGGAATACTGAGTTGTTTAGGACTTTCTATTGTGGCAAACTT 
CCAGAAAACAACCCTTTTTGCTGCACATGTAAGTGGAGCTGTGCTTACCTTTGGTATGGGCTCAT 
TATATATGTTTGTTCAGACCATCCTTTCCTACCAAATGCAGCCCAAAATCCATGGCAAACAAGTC 
TTCTGGATCAGACTGTTGTTGGTTATCTGGTGTGGAGTAAGTGCACTTAGCATGCTGACTTGCTC 
ATCAGTTTT GCACAGTGGCAATT TTGGGACTGAT TTAGAACAGAAAC TCCATTGGAACCCCGAGG 
AC AAAGGT T ATGT GC T T CACAT GAT CACTAC T GC AGC AGAAT GG T C T AT GT CAT TTTCCTTCTTT 
GGTTTTTTCCTGACTTACATTCGTGATTTTCAGAAAATTTCTTTACGGGTGGAAGCCAATTTACA 
TGGAT TAACCCT C TAT GACAC T GCACCTT GC CC TAT T AACAATGAACGAACACGGCTACT TT CCA 
GAGATATTTGATGAAAGGATAAAATATTTCTGTAATGATTATGATTCTCAGGGATTGGGGAAAGG 
TTCACAGAAGTTGCTTATTCTTCTCTGAAATTTTCAACCACTTAATCAAGGCTGACAGTAACACT 
GATGAATGCTGATAATCAGGAAACATGAAAGAAGCCATTTGATAGATTATTCTAAAGGATATCAT 
CAAGAAGACTAT TAAAAACACCTATGCCTATACTTTT T TAT CTCAGAAAATAAAGTCAAAAGACT 
ATG 
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FIGURE 2 



<subunit 1 of 1, 266 aa, 1 stop 
<MW: 29766, pi: 8.39, NX(S/T): 0 

MWWFQQGLSFLPSALVIWTSAAFIFSYITAVTLHHIDPALPYISDTGTVAPEKCLFGAMLNIAAV 

LC I AT I YVRYKQVHALS PEENVI IKLNKAGLVLG I L SCLGLS I VANFQKTT LFAAHVSGAVLT FG 
MGSLYMFVQTILSYQMQPKIHGKQVFWIRLLLVIWCGVSMSMLTCSSVLHSGNFGTDLEQKLHW 
NPEDKGYVLHM I TTAAEWSMS FS FFGFFLT YI RDFQKI S LRVEANLHGLTLYDTAPCP I NNERTR 
LLSRDI 

Important features: 

Type II transmembrane domain: 

amino acids 13-33 

Other Transmembrane domains: 

amino acids 54-73, 94-113, 160-180, 122-141 

N-myristoylation sites. 

amino acids 57-63, 95-101, 99-105, 124-130, 183-189 
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FIGURE 3 



CGGACGCGTGGGCGGACGCGTGGGGGAGAGCCGCAGTCCCGGCTGCAGCACCTGGGAGAAGGCAGACC 

GTGTGAGGGGGCCTGTGGCCCCAGCGTGCTGTGGCCTCGGGGAGTGGGAAGTGGAGGCAGGAGCCTTC 

CTTACACTT CGCCATGAGT T TCCTCAT C GACT C CAGC AT CATGAT TACCTC CCAGATACT ATTT T TTG 

GATTTGGGTGGCTTTTCTTCATGCGCCAATTGTTTAAAGACTATGAGATACGTCAGTATGTTGTACAG 

GTGATCTTCTCCGTGACGTTTGCATTTTCTTGCACCATGTTTGAGCTCATCATCTTTGAAATCTTAGG 

AGTATTGAATAGCAGCTCCCGTTATTTTCACTGGAAAATGAACCTGTGTGTAATTCTGCTGATCCTGG 

TTTTCATGGTGCCTTTTTACATTGGCTATTTTATTGTGAGCAATATCCGACTACTGCATAAACAACGA 

CTGCTTTTTTCCTGTCTCTTATGGCTGACCTTTATGTATTTCTTCTGGAAACTAGGAGATCCCTTTCC 

CATTCTCAGCCCAAAACATGGGATCTTATCCATAGAACAGCTCATCAGCCGGGTTGGTGTGATTGGAG 

TGACTCTCATGGCTCTTCTTTCTGGATTTGGTGCTGTCAACTGCCCATACACTTACATGTCTTACTTC 

CTCAGGAATGTGACTGACACGGATATTCTAGCCCTGGAACGGCGACTGCTGCAAACCATGGATATGAT 

CATAAGCAAAAAGAAAAGGATGGCAATGGCACGGAGAACAATGTTCCAGAAGGGGGAAGTGCATAACA 

AACCATCAGGTTTCTGGGGAATGATAAAAAGTGTTACCACTTCAGCATCAGGAAGTGAAAATCTTACT 

CTTATTCAACAGGAAGTGGATGCTTTGGAAGAATTAAGCAGGCAGCTTTTTCTGGAAACAGCTGATCT 

ATATGCTACCAAGGAGAGAATAGAATACTCCAAAACCTTCAAGGGGAAATATTTTAATTTTCTTGGTT 

ACTTTTTCTCTATTTACTGTGTTTGGAAAATTTTCATGGCTACCATCAATATTGTTTTTGATCGAGTT 

GGGAAAACGGATCCTGTCACAAGAGGCATTGAGATCACTGTGAATTATCTGGGAATCCAATTTGATGT 

GAAGTTTTGGTCCCAACACATTTCCTTCATTCTTGTTGGAATAATCATCGTCACATCCATCAGAGGAT 

TGCTGATCACTCTTACCAAGTTCTTTTATGCCATCTCTAGCAGTAAGTCCTCCAATGTCATTGTCCTG 

CTATTAGCACAGATAATGGGCATGTACTTTGTCTCCTCTGTGCTGCTGATCCGAATGAGTATGCCTTT 

AGAATACCGCAC CATAATCACT GAAGT C CTT GGAGAACTGCAGTTCAACTT CTATCACCGT TGGTTTG 

ATGTGATCTTCCTGGTCAGCGCTCTCTCTAGCATACTCTTCCTCTATTTGGCTCACAAACAGGCACCA 

GAGAAGCAAAT GGCACCT TGAACT T AAGCCT ACTACAGACT GT TAGAGGCCAGTGGTTTCAAAATTTA 

GATATAAGAGGGGGGAAAAAT GGAACCAGGGCCTGACATT T TATAAACAAACAAAAT GCTATGGTAGC 

ATTTTTCACCTTCATAGCATACTCCTTCCCCGTCAGGTGATACTATGACCATGAGTAGCATCAGCCAG 

AACATGAGAGGGAGAACTAACTCAAGACAATACTCAGCAGAGAGCATCCCGTGTGGATATGAGGCTGG 

TGTAGAGGCGGAGAGGAGCCAAGAAACTAAAGGTGAAAAATACACTGGAACTCTGGGGCAAGACATGT 

CTAT GGTAGCTG AGCCAAACACGT AGGAT TT C CGTT T TAAGGT TCAC ATGGAAAAGGTTAT AGCTTT G 

CCTTGAGATTGACTCATTAAAATCAGAGACTGTAAC^ 

ACTCTAGAGTCGACCTGCAGAAGCTTGGCCGCCATGGCCCAACTTGTTTATTGCAGCTTATAATG 



FIGURE 4 



MSFLIDSSIMITSQILFFGFGWLFFMRQLFKDYEIRQYWQVIFSVTFAFSCTMFELIIFEILGV 
LNSSSRYFHWKMNLCVI 

FPILSPKHGILSIEQLISRVGVIGVTI^LLSGFGAWCPYTYMSYFLRNVTDTDILALERRLLQ 
TMDM 1 1 SKKKRMAMARRTMFQKGE VHNKPS G FWGM I KS VT T SAS GSENLT L I QQE VDALEEL S RQ 
LFLETADLYATKERIEYSKTFKGKYFNFLGYFFS I YCVWKI FMATINIVFDRVGKTDPVTRGIEI 
TVNYLGI QFDVKFWS QHI S FI LVGI 1 1 VT S I RGLL I T LTKFFYAI S S SKS SNVI VLLLAQ IMGMY 
FVS S VLL I RMSMPLE YRT 1 1 TE VLGE LQ FN FYHRW FDV I FLVS ALS S I L FL YLAHKQAPEKQMAP 

Important features: 
Signal peptide: 

amino acids 1-23 

Potential transmembrane domains : 

amino acids 37-55, 81-102, 150-168, 288-311, 338-356, 375-398, 
425-444 

N-glycosylation sites. 

amino acids 67-70, 180-183 and 243-246 

Eukaryotic cobalamin-binding proteins 

amino acids 151-160 
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FIGURE g 

AGCAGGGAAATCCGGATGTCTCGGTTATGAAGTGGAGCAGTGAGTGTGAGCCTCAACATAGTTCC 
AGAACTCTCCATCCGGACTAGTTATTGAGCATCTGCCTCTCATATCACCAGTGGCCATCTGAGGT 
GTTTCCCTGGCTCTGAAGGGGTAGGCACGATGGCCAGGTGCTTCAGCCTGGTGTTGCTTCTCACT 
TCCATCTGGACCACGAGGCTCCTGGTCCAAGGCTCTTTGCGTGCAGAAGAGCTTTCCATCCAGGT 
G TCAT GCAGAAT TAT GGGGATCACCC TT GT GAGCAAAAAGGCGAACCAGCAGC T GAAT T TCACAG 
AAGCTAAGGAGGCCTGTAGGCTGCTGGGACTAAGTTTGGCCGGCAAGGACCAAGTTGAAACAGCC 
TTGAAAGCTAGCTTTGAAACTTGCAGCTATGGCTGGGTTGGAGATGGATTCGTGGTCATCTCTAG 
GATTAGCCCAAACCCCAAGTGTGGGAAAAATGGGGTGGGTGTCCTGATTTGGAAGGTTCCAGTGA 
GCCGACAGTTTGCAGCCTATTGTTACAACTCATCTGATACTTGGACTAACTCGTGCATTCCAGAA 
ATTATCACCACCAAAGAT CCCATAT TCAACACT CAAACTGCAACACAAACAACAGAATT TATTGT 
Z CAGTGACAGTACCTACTCGGTGGCATCCCCTTACTCTACAATACCTGCCCCTACTACTACTCCTC 
U CTGCTCCAGCTTCCACTTCTATTCCACGGAGAAAAAAATTGATTTGTGTCACAGAAGTTTTTATG 
Q GAAACT AGCACCAT GT CTACAGAAAC T GAACCAT T T G T TGAAAAT AAAGCAGC AT TCAAGAAT GA 

iff! AGCTGCTGGGTTTGGAGGTGTCCCCACGGCTCTGCTAGTGCTTGCTCTCCTCTTCTTTGGTGCTG 
f, s CAGCTGGTCTTGGATTTTGCTATGTCAAAAGGTATGTGAAGGCCTTCCCTTTTACAAACAAGAAT 
fl CAGCAGAAGGAAATGAT CGAAACCAAAGT AG TAAAGGAGGAGAAGGCCAAT GATAGCAACCCT AA 

^ T GAGGAAT CAAAGAAAACT GAT AAAAAC C CAGAAGAGT CCAAGAG T CCAAGCAAAAC T ACCGT GC 

m GATGCCTGGAAGCTGAAGT TTAGATGAGACAGAAATGAGGAGACACACCTGAGGC TGGTT TCT TT 

|R CATGCTCCTTACCCTGCCCCAGCTGGGGAAATGAAAAGGGCCAAAGAACCAAAGAAGAAAGTCCA 

J" CCC T TGGT T CCTAAC T GGAAT CAGCT CAGGACT GCCAT T GGAC TAT GGAGT GCACCAAAGAGAAT 

GCCCTTCTCCTTATTGTAACCCTGTCTGGATCCTATCCTCCTACCTCCAAAGCTTGCCACGGCCT 
**J TTCTAGCCTGGCTATGTCCTAATAATATCCCACTGGGAGAAAGGAGTTTTGCAAAGTGCAAGGAC 
If! CTAAAACATCTCATCAGTATCCAGTGGTAAAAAGGCCTCCTGGCTGTCTGAGGCTAGGTGGGTTG 
O AAAGCCAAGGAGTCACT GAGACCAAGGCT TT CTC TAC T GAT T CCGCAGC TCAGACCC T T T CT T CA 

m GC TC TGAAAGAGAAACACGTATCCCACCT GACAT GTCCT TC T GAGCC C GGT AAGAGCAAAAGAAT 

2 GGC AGAAAAGT T TAGCCCCT GAAAGCCAT GGAGAT TC T CAT AAC T T GAGACC TAATC TC T GT AAA 

!r! GC T AAAATAAAGAAATAGAACAAGGC T GAGGATACGACAG T ACAC T GT CAGCAGGGAC T GT AAAC 

fll ACAGACAGGGTCAAAGTGTTTTCTCTGAACACATTGAGTTGGAATCACTGTTTAGAACACACACA 
CTTACTTTTTCTGGTCTCTACCACTGCTGATATTTTCTCTAGGAAATATACTTTTACAAGTT^ACA 
AAAAT AAAAAC TCT T AT AAAT T T C TAT T T T T ATC TGAGT T AC AGAAAT GAT TAC T AAGGAAGAT T 
AC TCAG TAAT TT G T TTAAAAAGTAATAAAAT T CAACAAACAT T TGC TGAAT AGC TAC TAT AT GTC 
AAG T GC T GT GCAAGGTAT T ACACT CTGT AAT TGAATAT TAT T CCT CAAAAAAT TGCACATAGTAG 
AACGCTATCT GGGAAGCT AT T T T T T TCAGT T T TGATAT T TC TAGC T TAT C TAC T TC CAAACTAAT 
T T T TAT T T T T GC T GAGAC TAAT C T TAT T CAT T TT C TCTAAT AT GGCAACCAT TAT AACCT TAAT T 
TAT TAT T AACAT AC C T AAGAAG T ACAT T GT T ACC T C TAT AT AC C AAAG C AC AT T T T AAAAG T GC C 
ATTAACAAATGTATCACTAGCCCTCCTTTTTCCAACAAGAAGGGACTGAGAGATGCAGAAATATT 

TGTGACAAAAAATT AAAGCAT TTAGAAAACT T 
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FIGURE 6 



MARCFSLVLLLTSIWTTRLLVQGSLRAEELSIQVSCRIMGITLVSKKANQQLNFTEAKEACRLLG 
L S LAGKDQVETALKAS FETCS YGWVGDG FWI SR I S PNPKC GKNG VGVL I WKVPVSRQFAAYCYN 
SSDTWTNSCIPEIITTKDPIFNTQTATQTTEFIVSDSTYSVASPYSTIPAPTTTPPAPASTSIPR 
RKKLICVTEVFMETSTMSTETEPFVENKAAFKNEAAGFGGVPTALLVLALLFFGAAAGLGFCYVK 
RYVKAFPFTNKNQQKEMIETKVVKEEKANDSNPNEESKKTDKNPEESKSPSKTTVRCLEAEV 

Signal sequence: 

amino acids 1-16 

Transmembrane domain: 

amino acids 235-254 

N-glycosylation site. 

amino acids 53-57, 130-134, 289-293 

Casein kinase II phosphorylation site. 

amino acids 145-149, 214-218 



III Tyrosine kinase phosphorylation site. 

amino acids 79-88 



N-myristoylation site. 

amino acids 23-29, 65-71, 234-240, 235-239, 249-255, 253-259 
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FIGURE 7 



CGCCGCGCTCCCGCACCCGCGGCCCGCCCACCGCGCCGCTCCCGCATCTGCACCCGCAGCCCGGC 

GGCCTCCCGGCGGGAGCGAGCAGATCCAGTCCGGCCCGCAGCGCAACTCGGTCCAGTCGGGGCGG 

CGGCTGCGGGCGCAGAGCGGAGATGCAGCGGCTTGGGGCCACCCTGCTGTGCCTGCTGCTGGCGG 

CGGCGGTCCCCACGGCCCCCGCGCCCGCTCCGACGGCGACCTCGGCTCCAGTCAAGCCCGGCCCG 

GCTCTCAGCTACCCGCAGGAGGAGGCCACCCTCAATGAGATGTTCCGCGAGGTTGAGGAACTGAT 

GGAGGACACGCAGCACAAAT TGCGCAGCGCGGTGGAAGAGATGGAGGCAGAAGAAGCT GCTGCTA 

AAGCATCATCAGAAGTGAACCTGGCAAACTTACCTCCCAGCTATCACAATGAGACCAACACAGAC 

ACGAAGGTTGGAAATAATACCATCCATGTGCACCGAGAAATTCACAAGATAACCAACAACCAGAC 

T GGACAAAT GGT CT TT TCAGAGACAGT T AT CACATC T G T GGGAGACGAAGAAGGCAGAAGGAGCC 

ACGAGTGCATCATCGACGAGGACTGTGGGCCCAGCATGTACTGCCAGTTTGCCAGCTTCCAGTAC 

ACCTGCCAGCCATGCCGGGGCCAGAGGATGCTCTGCACCCGGGACAGTGAGTGCTGTGGAGACCA 

GCTGTGTGTCTGGGGTCACTGCACCAAAATGGCCACCAGGGGCAGCAATGGGACCATCTGTGACA 

ACCAGAGGGACTGCCAGCCGGGGCTGTGCTGTGCCTTCCAGAGAGGCCTGCTGTTCCCTGTGTGC 

ACACCCCTGCCCGTGGAGGGCGAGCTTTGCCATGACCCCGCCAGCCGGCTTCTGGACCTCATCAC 

CTGGGAGCTAGAGCCTGATGGAGCCTTGGACCGATGCCCTTGTGCCAGTGGCCTCCTCTGCCAGC 

CCCACAGCCACAGCCTGGTGTATGTGTGCAAGCCGACCTTCGTGGGGAGCCGTGACCAAGATGGG 

GAGATCCTGCTGCCCAGAGAGGTCCCCGATGAGTATGAAGTTGGCAGCTTCATGGAGGAGGTGCG 

CCAGGAGCTGGAGGACCTGGAGAGGAGCCTGACTGAAGAGATGGCGCTGGGGGAGCCTGCGGCTG 

CCGCCGCTGCACTGCTGGGAGGGGAAGAGATTTAGATCTGGACCAGGCTGTGGGTAGATGTGCAA 

TAGAAATAGCTAATTTATTTCCCCAGGTGTGTGCTTTAGGCGTGGGCTGACCAGGCTTCTTCCTA 

CATCTTCTTCCCAGTAAGTTTCCCCTCTGGCTTGACAGCATGAGGTGTTGTGCATTTGTTCAGCT 

CCCCCAGGCTGTTCTCCAGGCTTCACAGTCTGGTGCTTGGGAGAGTCAGGCAGGGTTAAACTGCA 

GGAGCAGTTTGCCACCCCTGTCCAGATTATTGGCTGCTTTGCCTCTACCAGTTGGCAGACAGCCG 

T TT GT T C TACATGGCT T T GATAAT T GT T T GAGGGGAGGAGAT GGAAAC AAT GTGGAGT C T CCC T C 

TGATTGGTTTTGGGGAAATGTGGAGAAGAGTGCCCTGCTTTGCAAACATCAACCTGGCAAAAATG 

CAACAAATGAATTTTCCACGCAGTTCTTTCCATGGGCATAGGTAAGCTGTGCCTTCAGCTGTTGC 

AGATGAAATGTTCTGTTCACCCTGCATTACATGTGTTTATTCATCCAGCAGTGTTGCTCAGCTCC 

TACCTCTGTGCCAGGGCAGCATTTTCATATCCAAGATCAATTCCCTCTCTCAGCACAGCCTGGGG 

AGGGGGTCATTGTTCTCCTCGTCCATCAGGGATCTCAGAGGCTCAGAGACTGCAAGCTGCTTGCC 

CAAGTCACACAGCTAGTGAAGACCAGAGCAGTTTCATCTGGTTGTGACTCTAAGCTCAGTGCTCT 

CTCCACTACCCCACACCAGCCTTGGTGCCACCAAAAGTGCTCCCCAAAAGGAAGGAGAATGGGAT 

TTTTCTTGAGGCATGCACATCTGGAATTAAGGTCAAACTAATTCTCACATCCCTCTAAAAGTAAA 

C TACT GT T AGGAACAG CAGT GT T C T CACAGT GTGGGGCAGC CGT C C T T CT AAT GAAGACAAT GAT 

AT TGACACT GTCCCTCTTT GGCAG T T GCAT TAG T AAC T T T GAAAGG TAT AT G AC T GAGC G TAGCA 

T ACAGGT TAACC T GCAGAAACAGTACT TAGGTAAT TGTAGGGCGAGGAT TAT AAATGAAATT TGC 

AAAATCACTTAGCAGCAACTGAAGACAATTATCAACCACGTGGAGAAAATCAAACCGAGCAGGGC 

T G T G T G AAAC AT G G T T G T AATAT GC GAC TGC G AAC AC T G AAC T C T AC G C C AC T C C AC AAAT GAT G 

TTTTCAGGTGTCATGGACTGTTGCCACCATGTATTCATCCAGAGTTCTTAAAGTTTAAAGTTGCA 

CAT GAT T G T AT AAGC AT GCTTTCTTT GAG T T T TAAAT TAT G TAT AAAC AT AAG T T GCAT T T AGAA 

AT CAAG CAT AAAT CACT T CAACT GCAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 8 



MQRLGATLLCLLLAAAVPTAPAPAPTATSAPVKPGPALSYPQEEATLNEMFREVEELMEDTQHKL 
RSAVEEME AEEAAAKAS SE VNLANLP PS YHNETNTDTKVGNNT I HVHRE I HKI TNNQTGQMVFSE 
TVI T S VGDEEGRRS HEC 1 1 DE DCGPSMYCQ FAS FQYTCQPCRGQRMLCTRDSECCGDQLCVWGHC 
TKMATRGSNGTICDNQRDCQPGLCCAFQRGLLFPVCTPLPVEGELCHDPASRLLDLITWELEPDG 
ALDRCPCASGLLCQPHSHSLVYVCKPTFVGSRDQDGEILLPREVPDEYEVGSFMEEVRQELEDLE 
RSLTEEMALGEPAAAAAALLGGEEI 



Casein kinase II phosphorylation site. 

amino acids 46-50, 67-71, 98-102, 135-139, 206-210, 312-316, 



Signal sequence: 




amino acids 1-19 



N-glycosylation site. 

amino acids 96-100, 106-110, 121-125, 204-208 



327-331 



N-myristoylation site. 

amino acids 202-208, 217-223 



Amidation site. 



amino acids 140-144 
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FIGURE Q 

CGGACGCGTGGGCGGACGCGTGGGGGCTGTGAGAAAGTGCCAATAAATACATCATGCAACCCCAC 
GGCCCACCTTGTGAACTCCTCGTGCCCAGGGCTGATGTGCGTCTTCCAGGGCTACTCATCCAAAG 
GCGTAATCCAACGTTCTGTCTTCAATCTGCAAATCTATGGGGTCCTGGGGCTCTTCTGGACCCTT 
AACTGGGTACTGGCCCTGGGCCAATGCGTCCTCGCTGGAGCCTTTGCCTCCTTCTACTGGGCCTT 
CCACAAGCCCCAGGACATCCCTACCTTCCCCTTAATCTCTGCCTTCATCCGCACACTCCGTTACC 
ACACTGGGTCATTGGCATTTGGAGCCCTCATCCTGACCCTTGTGCAGATAGCCCGGGTCATCTTG 
GAG TAT AT T GAC C ACAAGC T CAGAGGAGT GCAGAAC CC T GTAGCCCGCT GCAT CATG T G C T G T T T 
CAAGTGCTGCCTCTGGTGTCTGGAAAAATTTATCAAGTTCCTAAACCGCAATGCATACATCATGA 
TCGCCATCTACGGGAAGAATTTCTGTGTCTCAGCCAAAAATGCGTTCATGCTACTCATGCGAAAC 
ATTGTCAGGGTGGTCGTCCTGGACAAAGTCACAGACCTGCTGCTGTTCTTTGGGAAGCTGCTGGT 
GGTCGGAGGCGTGGGGGTCCTGTCCTTCTTTTTTTTCTCCGGTCGCATCCCGGGGCTGGGTAAAG 
ACTTTAAGAGCCCCCACCTCAACTATTACTGGCTGCCCATCATGACCTCCATCCTGGGGGCCTAT 
GTCATCGCCAGCGGCTTCTTCAGCGTTTTCGGCATGTGTGTGGACACGCTCTTCCTCTGCTTCCT 
GGAAGACCTGGAGCGGAACAACGGCTCCCTGGACCGGCCCTACTACATGTCCAAGAGCCTTCTAA 
AGATTCTGGGCAAGAAGAACGAGGCGCCCCCGGACAACAAGAAGAGGAAGAAGTGACAGCTCCGG 
CCCTGATCCAGGACTGCACCCCACCCCCACCGTCCAGCCATCCAACCTCACTTCGCCTTACAGGT 
CT.CCATTTTGTGGTAAAAAAAGGTTTTAGGCCAGGCGCCGTGGCTCACGCCTGTAATCCAACACT 
TTGAGAGGCTGAGGCGGGCGGATCACCTGAGTCAGGAGTTCGAGACCAGCCTGGCCAACATGGTG 
AAACC TCCGTCTC TAT T AAAAATACAAAAAT TAGCC GAGAGT GGTGGCATGC ACC TGTC AT C CC A 
GCTACTCGGGAGGCTGAGGCAGGAGAATCGCTTGAACCCGGGAGGCAGAGGTTGCAGTGAGCCGA 
GAT CGCGCC AC TGC AC T C CAACC T GGGT GAC AGAC T C T GT C T CCAAAACAAAACAAACAAACAAA 
AAGAT T T TAT T AAAG AT AT T T T G T T AAC T C 



FIGURE IP 

RTRGRTRGGCEKVPINTSCNPTAHLVNSSCPGLMCVFQGYSSKGLIQRSVFNLQIYGVLGLFWTL 
NWVI^GQCVLAGAFASFYWAFHKPQDIPTFPLISAFIRTLRYHTGSLAFGALILTLVQIARVIL 
EYIDHKLRGVQNPVARCIMCCFKCCLWCLEKFIKFLNRNAYIMIAIYGKNFCVSAKNAFMLLMRN 
IVRVWLDKVTDLLLFFGKLLWGGVGVLSFFFFSGRIPGLGKDFKSPHLNYYWLPIMTSILGAY 
VIASGFFSVFGMCVDTLFLCFLEDLERNNGSLDRPYYMSKSLLKILGKKNEAPPDNKKRKK 

Important features: 
I..... Transmembrane domains: 

A amino acids 57-80 (type II), 110-126, 215-231, 254-274 

W N-glycosylation sites. 

amino acids 16-20, 27-31, 289-293 

s Hypothetical YBR002c family proteins. 

Q amino acids 276-288 

if! 

if 8 * 

J: Ammonium transporters proteins. 

amino acids 204-231 

W 
?H 

N-myristoylation sites. 

amino acids 60-66, 78-84 

Amidation site. 

amino acids 306-310 
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FIGURE 11 

GCCCCGCGCCCGGCGCCGGGCGCCCGAAGCCGGGAGCCACCGCCATGGGGGCCTGCCTGGGAGCCTGC 
TCCCTGCTCAGCTGCGCGTCCTGCCTCTGCGGCTCTGCCCCCTGCATCCTGTGCAGCTGCTGCCCCGC 
CAGCCGCAACTCCACCGTGAGCCGCCTCATCTTCACGTTCTTCCTCTTCCTGGGGGTGCTGGTGTCCA 
TCATTATGCTGAGCCCGGGCGTGGAGAGTCAGCTCTACAAGCTGCCCTGGGTGTGTGAGGAGGGGGCC 
GGGATCCCCACCGTCCTGCAGGGCCACATCGACTGTGGCTCCCTGCTTGGCTACGGCGCTGTCTACCG 
CATGTGCTTCGCCACGGCGGCCTTCTTCTTCTTCTTTTTCACCCTGCTCATGCTCTGCGTGAGCAGCA 
GCCGGGACCCCCGGGCTGCCATCCAGAATGGGTTTTGGTTCTTTAAGTTCCTGATCCTGGTGGGCCTC 
ACCGTGGGTGCCTTCTACATCCCTGACGGCTCCTTCACCAACATCTGGTTCTACTTCGGCGTCGTGGG 
CTCCTTCCTCTTCATCCTCATCCAGCTGGTGCTGCTCATCGACTTTGCGCACTCCTGGAACCAGCGGT 
GGCTGGGCAAGGCCGAGGAGTGCGATTCCCGTGCCTGGTACGCAGGCCTCTTCTTCTTCACTCTCCTC 
TTCTACTTGCTGTCGATCGCGGCCGTGGCGCTGATGTTCATGTACTACACTGAGCCCAGCGGCTGCCA 
CGAGGGCAAGGTCTTCATCAGCCTCAACCTCACCTTCTGTGTCTGCGTGTCCATCGCTGCTGTCCTGC 
CCAAGGTCCAGGACGCCCAGCCCAACTCGGGTCTGCTGCAGGCCTCGGTCATCACCCTCTACACCATG 
TTTGTCACCTGGTCAGCCCTATCCAGTATCCCTGAACAGAAATGCAACCCCCATTTGCCAACCCAGCT 
jf| GGGCAACGAGACAGTTGTGGCAGGCCCCGAGGGCTATGAGACCCAGTGGTGGGATGCCCCGAGCATTG 
Ifl TGGGCCTCATCATCTTCCTCCTGTGCACCCTCTTCATCAGTCTGCGCTCCTCAGACCACCGGCAGGTG 
Q AACAGCCT GATGCAGACCGAGGAGT GCCCACC T AT GC TAGACGC CACACAGCAGCAGCAGCAGCAGGT 

fCl GGCAGCCTGTGAGGGCCGGGCCTTTGACAACGAGCAGGACGGCGTCACCTACAGCTACTCCTTCTTCC 

W ACTTCTGCCTGGTGCTGGCCTCACTGCACGTCATGATGACGCTCACCAACTGGTACAAGCCCGGTGAG 

fill 

'* * ACCCGGAAGATGATCAGCACGTGGACCGCCGTGTGGGTGAAGATCTGTGCCAGCTGGGCAGGGCTGCT 
CCTCTACCTGTGGACCCTGGTAGCCCCACTCCTCCTGCGCAACCGCGACTTCAGCTGAGGCAGCCTCA 
CAGCCTGCCATCTGGTGCCTCCTGCCACCTGGTGCCTCTCGGCTCGGTGACAGCCAACCTGCCCCCTC 
CCCACACCAATCAGCCAGGCTGAGCCCCCACCCCTGCCCCAGCTCCAGGACCTGCCCCTGAGCCGGGC 
CTTCTAGTCGTAGTGCCTTCAGGGTCCGAGGAGCATCAGGCTCCTGCAGAGCCCCATCCCCCCGCCAC 
ACCCACACGGTGGAGCTGCCTCTTCCTTCCCCTCCTCCCTGTTGCCCATACTCAGCATCTCGGATGAA 
AGGGCTCCCTTGTCCTCAGGCTCCACGGGAGCGGGGCTGCTGGAGAGAGCGGGGAACTCCCACCACAG 
TGGGGCATCCGGCACTGAAGCCCTGGTGTTCCTGGTCACGTCCCCCAGGGGACCCTGCCCCCTTCCTG 
GACTTCGTGCCTTACTGAGTCTCTAAGACTTTTTCTAATAAACAAGCCAGTGCGTGTAAAAAAAA 
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FIGURE 12 



MGACLGACSLLSCASCLCGSAPCILCSCCPASRNSTVSRLI FTFFLFLGVLVSIIMLSPGVESQL 

YKL PWVCEE GAG I PT VLQGHI DCGSLLGYRAVYRMC FATAAFFFFFFTLLMLCVS S S RDPRAAI Q 
NGFWFFKFLILVGLTVGAFYIPDGSFTNIWFYFGWGSFLFILIQLVLLIDFAHSWNQRWLGKAE 
ECDSRAWYAGLFFFTLLFYLLSIAAVALMFMYYTEPSGCHEGKVFISLNLTFCVCVS IAAVLPKV 
QDAQPNSGLLQASVITLYTMFVTWSALSS I PEQKCNPHLPTQLGNE TWAGPEGYETQWWDAPS I 
VGLIIFLLCTLFISLRSSDHRQVNSLMQTEECPPMLDATQQQQQQVAACEGRAFDNEQDGVTYSY 
SFFHFCLVIASLHVMMTLTNWYKPGETRKMISTWTAWVKICASWAGLLLYLWTLVAPLLLRNRD 

FS 

Signal sequence : 

amino acids 1-20 

Transmembrane domains: 

amino acids 40-58, 101-116, 134-150, 162-178, 206-223, 240-257, 
272-283, 324-340, 391-406, 428-444 
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FIGURE 13 

CGGGCCAGCCTGGGGCGGCCGGCCAGGAACCACCCGTTAAGGTGTCTTCTCTTTAGGGATGGTGA 
GG T T GGAAAAAGACT CCT GT AACCC T CCT CCAGGATG?\AC CACC T GC CAGAAGACAT GGAGAACG 
C T CTCACCGGGAGCCAGAGCTCCCAT GCT T C TC T GCGCAAT AT C CAT T CCAT CAACCCCACACAA 
CTCATGGCCAGGATTGAGTCCTATGAAGGAAGGGAAAAGAAAGGCATATCTGATGTCAGGAGGAC 
TTTCTGTTTGTTTGTCACCTTTGACCTCTTATTCGTAACATTACTGTGGATAATAGAGTTA71ATG 
TGAATGGAGGCATTGAGAACACATTAGAGAAGGAGGTGAT GCAGTATGACTACTAT TCT TCATAT 
TTTGATATATTTCTTCTGGCAGTTTTTCGATTTAAAGTGTTAATACTTGCATATGCTGTGTGCAG 
K ACTGCGCCATTGGTGGGCAATAGCGTTGACAACGGCAGTGACCAGTGCCTTTTTACTAGCAAAAG 
Q TGATCCTTTCGAAGCTTTTCTCTCAAGGGGCTTTTGGCTATGTGCTGCCCATCATTTCATTCATC 
If! CTTGCCTGGATTGAGACGTGGTTCCTGGATTTCAAAGTGTTACCTCAAGAAGCAGAAGAAGAAAA 
W CAGAC T CCT GATAGTT CAGGATGC TT CAGAGAGGGCAGCACT TATAC C T GGT GGT C TT T CT GATG 

^■f G T CAGT T T T AT TCCCC T CC T GAAT CCGAAGCAGGATC T GAAGAAGC T GAAGAAAAA.CAGGACAGT 

£ GAGAAACCACTTT TAGAACTATGAGTACTACTT T TGT TAAATGTGAAAAACCC TCACAGAAAGTC 

ATCGAGGCAAAAAGAGGCAGGCAGTGGAGTCTCCCTGTCGACAGTAAAGTTGAAATGGTGACGTC 
0 CAC TGC TGGCTT TAT T GAACAGCTAATAAAGATT TAT T TAT T GTAATACCT CACAAAC GT T GT AC 

III CATATCCATGCACATTTAGTTGCCTGCCTGTGGCTGGTAAGGTAATGTCATGATTCATCCTCTCT 

.SPSS, 

J*( TCAGTGAGACTGAGCCTGATGTGTTAACAAATAGGTGAAGAAAGTCTTGTGCTGTATTCCTAATC 
pi AAAAG AC T T AAT AT AT T GAAG T AACAC T T T T T TAG T AAG CAAGAT AC C T T T T TAT T T C AAT T CAC 
t%\ AGAATGGAATTTTTTTGTTTCATGTCTCAGATTTATTTTGTATTTCTTTTTTAACACTCTACATT 
TCCCTTGTTTTTTAACTCATGCACATGTGCTCTTTGTACAGTTTTAAAAAGTGTAATAAAATCTG 
ACATGTCAATGTGGCTAGTTTTATTTTTCTTGTTTTGCATTATGTGTATGGCCTGAAGTGTTGGA 
. CTTGC AAAAGGGGAAGAAAGGAAT T GC GAATACAT GT AAAAT G T CACCAGACAT T T GT AT TAT T T 
TTATCATGAAATCATGTTTTTCTCTGATTGTTCTGAAATGTTCTAAATACTCTTATTTTGAATGC 
ACAAAATGACTTAAACCATTCATATCATGTTTCCTTTGCGTTCAGCCAATTTCAATTAAAATGAA 
CTAAATTAAAAA 
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FIGURE id 



MNHLPEDMENALTGSQSSHASLRNIHSINPTQLMARIESYEGREKKGISDVRRTFCLFVTFDLLF 
VTLLW 1 1 ELNVNGG IENTLEKEVMQYDYYS S YFD I FLLAVFRFKVL I LAYAVCRLRHWWAI ALTT 
AVTSAFLIAKVILSKLFSQGAFGYVLPIISFILAWIETWFLDFKVLPQEAEEENRLLIVQDASER 
AAL I PGGLSDGQFYSPPE SEAGSEEAEEKQDSEKPLLEL 

Important features of the protein; 
Signal peptide: 

amino acids 1-20 

Transmembrane domains: 

amino acids 54-72, 100-118, 130-144, 146-166 
N-myristoylation sites. 

amino acids 14-20, 78-84, 79-85, 202-208, 217-223 
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FIGURE is 

ACTCGAACGCAGTTGCTTCGGGACCCAGGACCCCCTCGGGCCCGACCCGCCAGGAAAGACTGAGG 
CCGCGGCCTGCCCCGCCCGGCTCCCTGCGCCGCCGCCGCCTCCCGGGACAGAAGATGTGCTCCAG 
GGTCCCTCTGCTGCTGCCGCTGCTCCTGCTACTGGCCCTGGGGCCTGGGGTGCAGGGCTGCCCAT 
CCGGCTGCCAGTGCAGCCAGCCACAGACAGTCTTCTGCACTGCCCGCCAGGGGACCACGGTGCCC 
CGAGACGTGCCACCCGACACGGTGGGGCTGTACGTCTTTGAGAACGGCATCACCATGCTCGACGC 
AGGCAGCTTTGCCGGCCTGCCGGGCCTGCAGCTCCTGGACCTGTCACAGAACCAGATCGCCAGCC 
TGCCCAGCGGGGTCTTCCAGCCACTCGCCAACCTCAGCAACCTGGACCTGACGGCCAACAGGCTG 
CATGAAATCACCAATGAGACCTTCCGTGGCCTGCGGCGCCTCGAGCGCCTCTACCTGGGCAAGAA 
CCGCATCCGCCACATCCAGCCTGGTGCCTTCGACACGCTCGACCGCCTCCTGGAGCTCAAGCTGC 
AGGACAACGAGCTGCGGGCACTGCCCCCGCTGCGCCTGCCCCGCCTGCTGCTGCTGGACCTCAGC 
H 1 CACAACAGCCTCCTGGCCCTGGAGCCCGGCATCCTGGACACTGCCAACGTGGAGGCGCTGCGGCT 
p. GGCTGGTCTGGGGCTGCAGCAGCTGGACGAGGGGCTCTTCAGCCGCTTGCGCAACCTCCACGACC 
pi TGGATGTGTCCGACAACCAGCTGGAGCGAGTGCCACCTGTGATCCGAGGCCTCCGGGGCCTGACG 
£f CGCCTGCGGCTGGCCGGCAACACCCGCATTGCCCAGCTGCGGCCCGAGGACCTGGCCGGCCTGGC 
9 l TGCCCTGCAGGAGCTGGATGTGAGCAACCTAAGCCTGCAGGCCCTGCCTGGCGACCTCTCGGGCC 
UJ TCTTCCCCCGCCTGCGGCTGCTGGCAGCTGCCCGCAACCCCTTCAACTGCGTGTGCCCCCTGAGC 
\f TGGTTTGGCCCCTGGGTGCGCGAGAGCCACGTCACACTGGCCAGCCCTGAGGAGACGCGCTGCCA 
I s I CTTCCCGCCCAAGAACGCTGGCCGGCTGCTCCTGGAGCTTGACTACGCCGACTTTGGCTGCCCAG 
m CCACCACCACCACAGCCACAGTGCCCACCACGAGGCCCGTGGTGCGGGAGCCCACAGCCTTGTCT 
%F * TCTAGCTTGGCTCCTACCTGGCTTAGCCCCACAGCGCCGGCCACTGAGGCCCCCAGCCCGCCCTC 
« CACTGCCCCACCGACTGTAGGGCCTGTCCCCCAGCCCCAGGACTGCCCACCGTCCACCTGCCTCA 
O ATGGGGGCACATGCCACCTGGGGACACGGCACCACCTGGCGTGCTTGTGCCCCGAAGGCTTCACG 
[11 GGCCTGTACTGTGAGAGCCAGATGGGGCAGGGGACACGGCCCAGCCCTACACCAGTCACGCCGAG 
Si GCCACCACGGTCCCTGACCCTGGGCATCGAGCCGGTGAGCCCCACCTCCCTGCGCGTGGGGCTGC 
TI AGCGCTACCTCCAGGGGAGCTCCGTGCAGCTCAGGAGCCTCCGTCTCACCTATCGCAACCTATCG 
^| GGCCCTGATAAGCGGCTGGTGACGCTGCGACTGCCTGCCTCGCTCGCTGAGTACACGGTCACCCA 
0 GCTGCGGCCCAACGCCACTTACTCCGTCTGTGTCATGCCTTTGGGGCCCGGGCGGGTGCCGGAGG 
m GCGAGGAGGCCTGCGGGGAGGCCCATACACCCCCAGCCGTCCACTCCAACCACGCCCCAGTCACC 
CAGGCCCGCGAGGGCAACCTGCCGCTCCTCATTGCGCCCGCCCTGGCCGCGGTGCTCCTGGCCGC 
GCTGGCTGCGGTGGGGGCAGCCTACTGTGTGCGGCGGGGGCGGGCCATGGCAGCAGCGGCTCAGG 
ACAAAGGGCAGGTGGGGCCAGGGGCTGGGCCCCTGGAACTGGAGGGAGTGAAGGTCCCCTTGGAG 
CCAGGCCCGAAGGCAACAGAGGGCGGTGGAGAGGCCCTGCCCAGCGGGTCTGAGTGTGAGGTGCC 
ACTCATGGGCTTCCCAGGGCCTGGCCTCCAGTCACCCCTCCACGCAAAGCCCTACATCTAAGCCA 
GAGAGAGACAGGGCAGCTGGGGCCGGGCTCTCAGCCAGTGAGATGGCCAGCCCCCTCCTGCTGCC 
ACACCACGTAAGTTCTCAGTCCCAACCTCGGGGATGTGTGCAGACAGGGCTGTGTGACCACAGCT 
GGGCCCTGTTCCCTCTGGACCTCGGTCTCCTCATCTGTGAGATGCTGTGGCCCAGCTGACGAGCC 
CTAACGTCCCCAGAACCGAGTGCCTATGAGGACAGTGTCCGCCCTGCCCTCCGCAACGTGCAGTC 
CCTGGGCACGGCGGGCCCTGCCATGTGCTGGTAACGCATGCCTGGGTCCTGCTGGGCTCTCCCAC 
TCCAGGCGGACCCTGGGGGCCAGTGAAGGAAGCTCCCGGAAAGAGCAGAGGGAGAGCGGGTAGGC 
GGC T GT GT GAC TC TAGT CT T GGCCCCAGGAAGCGAAGGAACAAAAGAAAC T GGAAAGGAAGAT GC 
T T TAGGAACAT GTTTTGCTTTTT TAAAAT AT ATAT AT T TATAAGAGATCC T T TC CCATT TATT C T 
GGGAAGAT GT T T T T CAAAC T C AGAGAC AAGGAC TTTGGTTTT T GT AAGAC AAACGAT GAT ATGAA 
GGC C T T T TGTAAGAAAAAATAAAAGAT GAAGTGTGAAA 
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FIGURE 16 



MCSRVPLLLPLLLLLALGPGVQGCPSGCQCSQPQTVFCTARQGTTVPRDVPPDTVGLYVFENGIT 

MLDAGSFAGLPGLQLLDLSQNQIASLPSGVFQPLANLSNLDLTANRLHEITNETFRGLRRLERLY 

LGKNRIRHIQPGAFDTLDRLLELKLQDNELRAXPPLRLPRLLLLDLSHNSLLALEPGILDTAlSrV'E 

ALRLAGLGLQQLDEGLFSRLRNLHDLDVSDNQLERVPPVIRGLRGLTRLRLAGNTRIAQLRPEDL 

AGLAALQELDVSNLSLQALPGDLSGLFPRLRLIAAARNPFNCVGPLSWFGPWVRESHVTLASPEE 

TRCHFPPKNAGRLLLELDYADFGCPATTTTATVPTTRPWREPTALSSSLAPTWLSPTAPATEAP 

SPPSTAPPTVGPVPQPQDCPPSTCLNGGTCHLGTRHHLACLCPEGFTGLYCESQMGQGTRPSPTP 

VTPRPPRSLTLGIEPVSPTSLRVGLQRYLQGSSVQLRSLRLTYRNLSGPDKRLVTLRLPASLAEY 

TVTQLRPNATYSVCVMPLGPGRVPEGEEACGEAHTPPAVHSNHAPVTQAREGNLPLLIAPALAAV 

LLAALAAVGAAYCTORGRAMAAAAQDKGQVGPGAGPLELEGVKVPLEPGPKATEGGG 

CEVPLMGFPGPGLQS PLHAKPYI 

Important features: 
Signal peptide: 

amino acids 1-23 
Transmembrane domain: 

amino acids 579-599 

EGF-like domain cysteine pattern signature, 

amino acids 430-442 
Leucine zipper pattern, 
amino acids 197-219, 269-291 
N-glycosylation sites. 

amino acids 101-105, 117-121, 273-277, 500-504, 528-532 
Tyrosine kinase phosphorylation sites, 
amino acids 124-131, 337-345 
N-myristoylation sites. 

amino acids 23-29, 27-33, 70-76, 142-148, 187-193, 348-354, 
594-600, 640-646 
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FIGURE 17 



GCAGCGGCGAGGCGGCGGTGGTGGCTGAGTCCGTGGTGGCAGAGGCGAAGGCGACAGCTCATGCG 
GGTCCGGATAGGGCTGACGCTGCTGCTGTGTGCGGTGCTGCTGAGCTTGGCCTCGGCGTCCTCGG 
ATGAAGAAGGCAGCCAGGAT GAAT CC T TAGAT TC CAAGAC T AC T T T GACAT CAGATGAGT CAGTA 
AAGGACCATACTAGTGCAGGCAGAGTAGT TGCTGGTCAAATATT TCT T GAT TCAGAAGAATCTGA 
ATTAGAAT C CT C TAT T CAAGAAGAGGAAGACAGCC TCAAGAGCCAAGAGGGGGAAAGT GTCACAG 
AAGATATCAGCT T TCTAGAGTC TCCAAATCCAGAAAACAAGGAC TATGAAGAGCCAAAGAAAG TA 
CGGAAACCAGCTTTGACCGCCATTGAAGGCACAGCACATGGGGAGCCCTGCCACTTCCCTTTTCT 
T T TCC TAGAT AAGGAG TAT GATGAATG T ACATCAGAT GGGAGGGAAGAT GGCAGAC TGT GGT GTG 
C TACAAC CT AT GAC T ACAAAGCAGAT GAAAAGT GGGGCT T T T GTGAAACT GAAGAAGAG GCT GC T 
AAGAGACGGCAGATGCAGGAAGCAGAAATGATGTATCAAACTGGAATGAAAATCCTTAATGGAAG 
CAATAAGAAAAGCCAAAAAAGAGAAGCATATCGGTATCTCCAAAAGGCAGCAAGCATGAACCATA 
CCAAAGCCCTGGAGAGAGTGTCATATGCTCTTTTATTTGGTGATTACTTGCCACAGAATATCCAG 
GCAGCGAGAGAGATGTT TGAGAAGCTGACTGAGGAAGGCTC TCCCAAGGGACAGACTGCTCT T GG 
CTTTCTGTATGCCTCTGGACTTGGTGTTAATTCAAGTCAGGCAAAGGCTCTTGTATATTATACAT 
T T GGAGC T C T TGGGGGCAAT C TAATAGCC CACAT GGT T TT GGTAAGT AG ACT T TAGT GG AAGGC T 
AATAATAT T AACATCAGAAGAAT TTGTGGTT TATAGCGGCCACAAC T T T T TCAGC T T TCAT GAT C 
CAGATTTGCTTGTATTAAGACCAAATATTCAGTTGAACTTCCTTCAAATTCTTGTTAATGGATAT 
AACACAT GGAAT C T AC AT GT AAAT GAAAGT T GGT GGAGT CC ACAAT T T T T CT T TAAAAT GAT TAG 
TTTGGCTGATTGCCCCTAAAAAGAGAGATCTGATAAATGGCTCTTTTTAAATTTTCTCTGAGTTG 
GAATTGTCAGAATCATTTTTTACATTAGATTATCATAATTTTAAAAATTTTTCTTTAGTTTTTCA 
AAAT T T T G T AAAT GGT G GC T AT AG AAAAAC AAC AT G AAAT AT TAT ACAAT AT T T T G C AAC AAT G C 
CCTAAGAATT GT TAAAAT TCATGGAGTT AT TTGTGCAGAAT GAC TCCAGAGAGCTCTACTTT CTG 
TTTTTTACTTTTCATGATTGGCTGTCTTCCCATTTATTCTGGTCATTTATTGCTAGTGACACTGT 
GCCTGCTTCCAGTAGTCTCATTTTCCCTATTTTGCTAATTTGTTACTTTTTCTTTGCTAATTTGG 
AAGATTAACTCAT TTTT AAT AAAATTATGTCTAAGAT T AAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



FIGURE 18 



MRVRIGLTLLLCAVLLSLASASSDEEGSQDESLDSKTTLTSDESVKDHTTAGRWAGQIFLDSEESEL 
ESSIQEEEDSLKSQEGESVTEDISFLESPNPENKDYEEPKKVRKPALTAIEGTAHGEPCHFPFLFLDK 
EYDECTSDGREDGRLWCATTYDYKADEKWGFCETEEEAAKRRQ^ 

EAYRYLQKAASMNHTKALERVSYALLFGDYLPQNIQAAREMFEKLTEEGSPKGQTALGFLYASGLGVN 
S S QAKALVYYT FGALGGNL I AHMVLVS RL 

Important features: 
L, Signal peptide: 

2ll amino acids 1-21 

if:??: 

%H 

yj N-glycosylation sites. 

% 4 amino acids 195-199, 217-221, 272-276 

iy 

Ws Tyrosine kinase phosphorylation site. 

L. amino acids 220-228 

m 

r! N-myristoylation sites. 

CO amino acids 120-126, 253-259, 268-274, 270-274, 285-291, 289-295 

o 

Glycosaminoglycan attachment site. 

amino acids 267-271 



Microbodies C- terminal targeting signal. 

amino acids 299-303 

Type II fibronectin collagen -binding domain protein. 

amino acids 127-169 

Fructose-bisphosphate aldolase class-II protein. 

amino acids 101-119 
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FIGURE 1Q 



AAT TCAGAT T T TAAGCCCAT TCTGCAGT GGAAT T T CAT GAACT AGCAAGAGGACACCAT C T TCT T 
GTATTATACAAGAAAGGAGTGTACCTATCACACACAGGGGGAAAAATGCTCTTTTGGGTGCTAGG 
CCTCCTAATCCTCTGTGGTTTTCTGTGGACTCGTAAAGGAAAACTAAAGATTGAAGACATCACTG 

ATAAGT ACAT T T T TAT CACT GGAT G T GAC T C GGGC T T T G GAAAC TTGGCAGCCAGAACT TT T GAT 

AAAAAGGGATTTCATGTAATCGCTGCCTGTCTGACTGAATCAGGATCAACAGCTTTAAAGGCAGA 

AACCTCAGAGAGACTTCGTACTGTGCTTCTGGATGTGACCGACCCAGAGAATGTCAAGAGGACTG 

CCCAGTGGGTGAAGAACCAAGTTGGGGAGAA?VGGTCTCTGGGGTCTGATCAATAATGCTGGTGTT 

CCCGGCGT GCTGGCTCCCACTGAC TGGC TGACACTAGAGGACTACAGAGAACCTAT TGAAGT GAA 

CCTGTTTGGACTCATCAGTGTGACACTAAATATGCTTCCTTTGGTCAAGAAAGCTCAAGGGAGAG 

TTATTAATGTCTCCAGTGTTGGAGGTCGCCTTGCAATCGTTGGAGGGGGCTATACTCCATCCAAA 

TATGCAGT GGAAGGT TT CAATGACAGCT T AAGACGGGACAT GAAAGC T T T T GGTGTGCACGT CTC 

ATGCATTGAACCAGGATTGTTCAAAACAAACTTGGCAGATCCAGTAAAGGTAATTGAAAAAAAAC 

TCGCCAT T TGGGAGCAGCT GTC TCCAGACAT CAAACAACAAT AT GGAGAAGGTTACAT T GAAAAA 

AGTCTAGACAAACTGAAAGGCAATAAATCCTATGTGAACATGGACCTCTCTCCGGTGGTAGAGTG 

CAT GGACCACGC TCT AACAAGTC TC T T CC C TAAGACT CAT TATGCCGCT GGAAAAGATGCCAAAA 

TTTTCTGGATACCTCTGTCTCACATGCCAGCAGCTTTGCAAGACTTTTTATTGTTGAAACAGAAA 

GCAGAGCTGGCTAATCCCAAGGCAGTGTGACTCAGCTAACCACAAATGTCTCCTCCAGGCTATGA 

AATTGGCCGATTTCAAGAACACATCTCCTTTTCAACCCCATTCCTTATCTGCTCCAACCTGGACT 

CATTTAGATCGTGCTTATTTGGATTGCAAAAGGGAGTCCCACCATCGCTGGTGGTATCCCAGGGT 

CCCTGCTCAAGTTTTCTTTGAAAAGGAGGGCTGGAATGGTACATCACATAGGCAAGTCCTGCCCT 

GTATTTAGGCTTTGCCTGCTTGGTGTGATGTAAGGGAAATTGAAAGACTTGCCCATTCAAAATGA 

TCTTTACCGTGGCCTGCCCCATGCTTATGGTCCCCAGCATTTACAGTAACTTGTGAATGTTAAGT 

AT CAT C T C T TATC TAAAT AT TAAAAGAT AAG TCAACCCAAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAA 



FIGURE 20 



MLFWVLGLLILCGFLWTRKGKLKIEDITDKYIFITGCDSGFGNLAARTFDKKGFHVIAACLTESG 
S TALKAET SERLRTVLLDVT DPENVKRTAQWVKNQVGEKGLWGL INNAGVPGVLAPTDWLTLE DY 
REP IEVNLFGL I SVTL^fMLPLVKKAQGRVINVSS VGGRLAI VGGGYTPSKYAVEGFNDSLRRDMK 
AFGVHVS C I E PGL FKTNLADP VKV I EKKLAI WE QL S PD I KQQYGEG Y I EKS LDKLKGNKS YVNMD 
LSPVVECMDHALTSLFPKTHYAAGKDAKIFWIPLSHMPAALQDFLLLKQKAEIANPKAV 

Important features of the protein: 
Signal peptide: 

amino acids 1-17 

Transmembrane domain: 

amino acids 136-152 

N-glycosylation sites. 

amino acids 161-163, 187-190 and 253-256 

GlYCosaminoglYcan attachment site. 

amino acids 39-42 

N-myristoylation sites. 

amino acids 36-41, 42-47, 108-113, 166-171, 198-203 and 207-212 
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FIGURE 21 



CTGAGGCGGCGGTAGCATGGAGGGGGAGAGTACGTCGGCGGTGCTCTCGGGCTTTGTGCTCGGCG 
CACTCGCTTTCCAGCACCTCAACACGGACTCGGACACGGAAGGTTTTCTTCTTGGGGAAGTAAAA 
GGTGAAGCCAAGAACAGCATTACTGATTCCCAAATGGATGATGTTGAAGTTGTTTATACAATTGA 
CATTCAGAAATATAT T CCAT GC TATCAGC T T T T TAGC T T T T AT AAT TC TT CAGGCGAAGT AAATG 
AGCAAGCAC TGAAGAAAATAT TATCAAAT GT CAAAAAGAAT GT GGT AGGT T GGTACAAAT T C CGT 
CGTCATTCAGATCAGATCATGACGTTTAGAGAGAGGCTGCTTCACAAAAACTTGCAGGAGCATTT 
T T CAAACCAAGACC T T GT T T T T CT GC TAT T AAC ACCAAG T AT AAT AACAGAAAGCT GCT C TACT C 
|~! AT CG AC T G G AACAT T C C T T AT AT AAAC C T C AAAAAG G AC T T T T T C AC AG G G T AC C T T TAG T G G T T 

p, • GCCAATCTGGGCATGTCTGAACAACTGGGTTATAAAACTGTATCAGGTTCCTGTATGTCCACTGG 
|| T T TTAGCCGAGCAGTACAAACACACAGCT CT AAAT T T T T T GAAGAAGAT GGATCC T TAAAGGAGG 

W TAC AT AAGAT AAAT GAAAT GT AT GC T T CAT T AC AAGAG GAAT T AAAGAG T AT AT GC AAAAAAGT G 

N GAAGACAGTGAACAAGCAGTAGATAAACTAGTAAAGGATGTAAACAGAT TAAAACGAGAAATT GA 

Sf GAAAAGGAGAGGAGCACAGAT T CAGGCAGCAAGAGAGAAGAAC ATCCAAAAAGACCCT CAGGAGA 

ACATTTTTCTTTGTCAGGCATTACGGACCTTTTTTCCAAATTCTGAATTTCTTCATTCATGTGTT 
Q ATGTC TT TAAAAAATAGACATGT TTC T AAAAGTAGC T G TAAC T ACAAC CAC C ATC TCGAT GTAGT 

III AGACAAT C T GAC C T T AAT G G T AGAAC AC AC T GAC AT T C C T G AAG C TAG T C CAG C TAG TAC AC CAC 

'U AAAT CAT T AAGCATAAAGCCT T AGACT T AGATGACAGAT GG CAAT T C AAGAGATC T CGGT T GT TA 

2f GAT AC AC AAGACAAAC GAT C T AAAGC AAAT AC T G G TAG TAG TAAC C AAGAT AAAGC ATC CAAAAT 

|| GAGCAGCCCAGAAACAGATGAAGAAAT T GAAAAGATGAAGGG T T T T GG T GAATAT T CACGGT CT C 

CTACATTTTGATCCTTTTAACCTTACAAGGAGATTTTTTTATTTGGCTGATGGGTAAAGCCAAAC 
ATTTCTATTGTTTTTACTATGTTGAGCTACTTGCAGTAAGTTCATTTGTTTTTACTATGTTCACC 
TGT TTGCAGTAATACACAGATAACTCTTAGTGCATT TACT TCACAAAGTACTT TTTCAAACATCA 
GATGCTTTTATTTCCAAACCTTTTTTTCACCTTTCACTAAGTTGTTGAGGGGAAGGCTTACACAG 
ACACAT T CT T T AGAAT T GGAAAAGT GAGACC AGGC ACAG T G GC T CACACC T GT AATCCCAGCACT 
TAGGGAAGACAAGTCAGGAGGAT TGAT T GAAGCT AGGAGT T AGAGACCAGCC T GGGCAACG TAT T 
GAGACCAT GT CTAT TAAAAAATAAAAT GGAAAAGCAAGAAT AGCCT TAT T T T CAAAATAT GGAAA 
GAAATT TATATGAAAATT TATCTGAGTCATTAAAAT TCT CCT TAAGTGATACTT TT TTAGAAGTA 
CAT TAT GGCTAGAGT T GCCAGAT AAAAT GC T GGAT AT C ATGCAATAAAT T T GC AAAACAT C AT CT 
AAAATTTAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 22 



h 
n 



Hi 



b 5 l 



MEGESTSAVLSGFVLGALAFQHLNTDSDTEGFLLGEVKGEAKNSITDSQMDDVEWYTIDIQKYI 
PCYQLFSFYNSSGEWEQALKKILSNVKKNWGWYKFRRHSDQIMTFRERLLHKNLQEHFSNQDL 
VFLLLTPSIirESCSTHRLEHSLYKPQKGLFHRVPLWANLGMSEQLGYKTVSGSCMSTGFSRAV 
QTHS SK FFEE DGSLKE VHKI NEM YASLQE E LKS I CKKVE DSE QAVDKLVKDVNRLKRE I EKRRGA 
QIQAAREKNIQKDPQENIFLCQALRTFFPNSEFLHSCVMSLKNRHVSKSSCNYNHHLDWDNLTL 
3MVE HT D I PEAS PAS T PQ 1 1 KHKALDLDDRWQFKRS RLLD T QDKRS KANT G S SNQDKAS KMS S PET 
DEEIEKMKGFGEYSRSPTF 

Important features : 
Signal peptide: 

amino acids 1-19 



fji N-glycosylation sites. 

amino acids 75-79, 322-326 



N-myristoylation site. 

amino acids 184-154 



I'll Growth factor and cytokines receptors family. 

amino acids 134-150 
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FIGURE 23 



GGCACAGCCGCGCGGCGGAGGGCAGAGTCAGCCGAGCCGAGTCCAGCCGGACGAGCGGACCAGCGCAGGGCAGCCCAA 
GCAGCGCGCAGCGAACGCCCGCCGCCGCCCACACCCTCTGCGGTCCCCGCGGCGCCTGCCACCCTTCCCTCCTTCCCC 
GCGTCCCCGCCTCGCCGGCCAGTCAGCTTGCCGGGTTCGCTGCCCCGCGAAACCCCGAGGTCACCAGCCCGCGCCTCT 
GCTTCCCTGGGCCGCGCGCCGCCTCCACGCCCTCCTTCTCCCCTGGCCCGGCGCCTGGCACCGGGGACCGTTGCCTGA 
CGCGAGGCCCAGCTCTACTTTTCGCCCCGCGTCTCCTCCGCCTGCTCGCCTCTTCCACCAACTCCAACTCCTTCTCCC 
TCCAGCTCCACTCGCTAGTCCCCGACTCCGCCAGCCCTCGGCCCGCTGCCGTAGCGCCGCTTCCCGTCCGGTCCCAAA 
GGTGGGAACGCGTCCGCCCCGGCCCGCACCATGGCACGGTTCGGCTTGCCCGCGCTTCTCTGCACCCTGGCAGTGCTC 
AGCGCCGCGCTGCTGGCTGCCGAGCTCAAGTCGAAAAGTTGCTCGGAAGTGCGACGTCTTTACGTGTCCAAAGGCTTC 
fc* AACAAGAACGATGCCCCCCTCCACGAGATCAACGGTGATCATTTGAAGATCTGTCCCCAGGGTTCTACCTGCTGCTCT 
P CAAGAGATGGAGGAGAAGTACAGCCTGCAAAGTAAAGATGATTTCAAAAGTGTGGTCAGCGAACAGTGCAATCATTTG 
P CAAGCTGTCTTTGCTTCACGTTACAAGAAGTTTGATGAATTCTTCAAAGAACTACTTGAAAATGCAGAGAAATCCCTG 
0] AATGATATGTTTGTGAAGACATATGGCCATTTATACATGCAAAATTCT GAGCTATTTAAAGAT CT CTTCGTAGAGTTG 

ly AAACGTTACT ACGT GGTGGGAAAT GT GAACCTGGAAGAAAT GCTAAAT GACTT CT GGGCT C G C CT C CT GGAGCGGATG 

TTC CGC CT GGT GAACT CCCAGTACCACT TTACAGAT GAGT ATCT GGAAT GT GTGAGCAAGT AT ACGGAG CAGCT GAAG 
Ml CCCTTCGGAGATGTCCCTCGCAAATTGAAGCTCCAGGTTACTCGTGCTTTTGTAGCAGCCCGTACTTTCGCTCAAGGC 
SF I TTAGCGGTTGCGGGAGATGTCGTGAGCAAGGTCTCCGTGGTAAACCCCACAGCCCAGTGTACCCATGCCCTGTTGAAG 
I _ AT GAT CTACT GCT CCCACT GC CGGGGTCT CGT GACT GT GAAGC CAT GTTAC AACT ACT GCT CAAACAT CAT GAGAGGC 

U T GTT T G GCCAACCAAGGGGAT CT CGATT TTGAAT GGAACAATT T CATAGAT GC TAT GCT GAT GGTGGCAGAGAGGCTA 

10 GAGGGTCCTTTCAACATTGAATCGGTCATGGATCCCATCGATGTGAAGATTTCTGATGCTATTATGAACATGCAGGAT 
W AATAGTGTTCAAGTGTCTCAGAAGGTTTTCCAGGGATGTGGACCCCCCAAGCCCCTCCCAGCTGGACGAATTTCTCGT 
PI TCCATCTCTGAAAGTGCCTTCAGTGCTCGCTTCAGACCACATCACCCCGAGGAACGCCCAACCACAGCAGCTGGCACT 
U AGTTTGGACCGACTGGTTACTGATGTCAAGGAGAAACTGAAACAGGCCAAGAAATTCTGGTCCTCCCTTCCGAGCAAC 
§y GTTT GCAAC GAT GAGAGGATGGCT GCAGGAAACGGCAAT GAG GAT GACT GTT GGAAT GGGAAAGGCAAAAGCAG GTAC 

CT GTTT GCAGT GACAGGAAAT GGATTAGC CAACCAGGGCAACAACCCAGAG GT CCAGGT TGACACCAGCAAACCAGAC 
ATACTGATCCTTCGTCAAATCATGGCTCTTCGAGTGATGACCAGCAAGATGAAGAATGCATACAATGGGAACGACGTG 

GACTTCT TT GAT AT CAGT GAT GAAAGT AGT GGAGAAGGAAGT GGAAGT G GCT GT GAGTATCAGCAGT GC CCT T CAGAG 
TTTGACTACAATGCCACTGACCATGCTGGGAAGAGTGCCAATGAGAAAGCCGACAGTGCTGGTGTCCGTCCTGGGGCA 
CAGGCCTACCTCCTCACTGTCTTCTGCATCTTGTTCCTGGTTATGCAGAGAGAGTGGAGATAATTCTCAAACTCTGAG 
AAAAAGTGTTCATCAAAAAGTTAAAAGGCACCAGTTATCACTTTTCTACCATCCTAGTGACTTTGCTTTTTAAATGAA 
TGGACAACAATGTACAGTTTTTACTATGTGGCCACTGGTTTAAGAAGTGCTGACTTTGTTTTCTCATTCAGTTTTGGG 

AGGAAAAGGGACTGT GCAT T GAGTT GGTT CCT GCT CCCCCAAACCAT GTTAAAC GTGGCT AACAGT GTAGGTACAGAA 
CTATAGTTAGTTGTGCATTTGTGATTTTATCACTCTATTATTTGTTTGTATGTTTTTTTCTCATTTCGTTTGTGGGTT 
TTTTTTTCCAACTGTGATCTCGCCTTGTTTCTTACAAGCAAACCAGGGTCCCTTCTTGGCACGTAACATGTACGTATT 
TC TGAAATATTAAATAGCT GT ACAGAAGCAGGTT T T ATTT AT CAT GT TAT CT TATT AAAAGAAAAAGCC C AAAAAGC 
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FIGURE 24 



MARFGLPALLCTLAVLSAALL^ 

CCSQEMEEKYSLQSKDDFKSWSEQCNHLQAVFASRYKKFDEFFKELLENAEKSLNDMFVKTYGH 
LYMQNSE L FKDL FVE LKRYYWG1WNLEEMLNDFWARLLERMFRLVNS QYH FTDEYLECVSKYTE 
QLKP FGDVPRKLKLQVTRAFVAART FAQGLAVAGDWSKVS WNP TAQCTHALLKMI YCSHCRGL 
VTVTCPC YNYCSNIMRGCLANQGDLD FEWNNFI DAMLMVAERLEGPFN I E SVMDP IDVKI S DAIMN 
MQDNSVQVSQKVFQGCGPPKPLPAGRISRSISESAFSARFRPHHPEERPTTAAGTSLDRLVTDVK 
EKLKQAKKFWSSLPSNVCNDERMAAGNGNEDDCWNGKGKSRYLFAVTGNGIANQGNNPEVQVDTS 
KPDILILRQIMALRVMTSKMKNAYNGNDVDFFDISDESSGEGSGSGCEYQQCPSEFDYNATDHAG 
KSANEKADS AGVRPGAQAYLLTVFC I L FLVMQREWR 



II; I; 



Important features: 
Signal peptide: 

amino acids 1-22 

ATP/GTP-binding site motif A (P-loop) 



|fl amino acids 515-524 

w 

Jjjf N-glycosylation site. 



amino acids 514-518 

Glycosaminoglycan attachment sites . 

amino acids 494-498, 498-502 



N-myristoylation sites. 

amino acids 63-69, 224-230, 276-282, 438-444, 497-503, 531-537 
Glypicans proteins. 

amino acids 54-75, 105-157, 238-280, 309-346, 423-460, 468-506 
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FIGURE 25 



CTCGCCCTCAAATGGGAACGCTGGCCTGGGACTAAAGCATAGACCACCAGGCTGAGTATCCTGAC 
CTGAGTCATCCCCAGGGATCAGGAGGCTCCAGCAGGGAACCTTCCATTATATTCTTCAAGCAACT 
TACAGCTGCACCGACAGTTGCGATGAAAGTTCTAATCTCTTCCCTCCTCCTGTTGCTGCCACTAA 
TGCTGATGTCCATGGTCTCTAGCAGCCTGAATCCAGGGGTCGCCAGAGGCCACAGGGACCGAGGC 
CAGGC TT CT AGGAGAT GG CT CCAGGAAGGCG GCCAAGAAT GTGAGTGCAAAGAT T GGT T C CT GAG 
AGCCCCGAGAAGAAAATTCATGACAGTGTCTGGGCTGCCAAAGAAGCAGTGCCCCTGTGATCATT 
TCAAGGGCAATGTGAAGAAAACAAGACACCAAAGGCACCACAGAAAGCCAAACAAGCATTCCAGA 
GCCTGCCAGCAATTTCTCAAACAATGTCAGCTAAGAAGCTTTGCTCTGCCTTTGTAGGAGCTCTG 
AGCGC CCAC T C T T CCAAT T AAACAT TC TCAGCCAAGAAGACAGTGAGCACACCTACCAGACAC TC 
TTCTTCTCCCACCTCACTCTCCCACTGTACCCACCCCTAAATCATTCCAGTGCTCTCAAAAAGCA 
TGTTTTTCAAGATCATTTTGTTTGTTGCTCTCTCTAGTGTCTTCTTCTCTCGTCAGTCTTAGCCT 
GTGCCCTCCCCTTACCCAGGCTTAGGCTTAATTACCTGAAAGATTCCAGGAAACTGTAGCTTCCT 
AGC TAG T G T CAT T T AAC C T T AAAT GCAAT CAGGAAAG T AGCAAACAGAAG T CAAT AAAT AT T T T T 
AAATGTCAAAAAAAAAAAAAAAAAA 
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FIGURE 26 

MKVLISSLLLLLPLMLMSMVSSSLNPGVARGHRDRGQASRRWLQEGGQECECKDWFLRAPRRKFM 
TVSGLPKKQCPCDHFKGNVKKTRHQRHHRKPNKHSRACQQFLKQCQLRSFALPL 

Important features: 
Signal peptide: 

amino acids 1-22 

N-myristoylation sites. 

amino acids 27-33, 4 6-52 
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FIGURE 27 

GGACGCCAGCGCCTGCAGAGGCTGAGCAGGGAAAAAGCCAGTGCCCCAGCGGAAGCACAGCTCAG 
AGCTGGTCTGCCATGGACATCCTGGTCCCACTCCTGCAGCTGCTGGTGCTGCTTCTTACCCTGCC 
CCTGCACCTCATGGCTCTGCTGGGCTGCTGGCAGCCCCTGTGCAAAAGCTACTTCCCCTACCTGA 
T GGCC GT GCT GACT CC CAAGAGCAACCGCAAGATGGAGAGCAAGAAACGGGAGCTCTTCAGCCAG 
ATAAAGGGGCTTACAGGAGCCTCCGGGAAAGTGGCCCTACTGGAGCTGGGCTGCGGAACCGGAGC 
CAACTTTCAGTTCTACCCACCGGGCTGCAGGGTCACCTGCCTAGACCCAAATCCCCACTTTGAGA 

AG T TCCT GACAAAGAGCAT GGC T GAGAACAGGCACCT CCAATAT GAGC GG T T TG T GGTGGCT CC T 
GGAGAGGACATGAGACAGCTGGCTGATGGCTCCATGGATGTGGTGGTCTGCACTCTGGTGCTGTG 
CTCTGTGCAGAGCCCAAGGAAGGTCCTGCAGGAGGTCCGGAGAGTACTGAGACCGGGAGGTGTGC 
TCTTTTTCTGGGAGCATGTGGCAGAACCATATGGAAGCTGGGCCTTCATGTGGCAGCAAGTTTTC 
GAGCCCACCTGGAAACACATTGGGGATGGCTGCTGCCTCACCAGAGAGACCTGGAAGGATCTTGA 
GAACGCCCAGTTCTCCGAAATCCAAATGGAACGACAGCCCCCTCCCTTGAAGTGGCTACCTGTTG 
GGCCCCACATCATGGGAAAGGCTGTCAAACAATCTTTCCCAAGCTCCAAGGCACTCATTTGCTCC 
TTCCCCAGCCTCCAATTAGAACAAGCCACCCACCAGCCTATCTATCTTCCACTGAGAGGGACCTA 
GCAGAATGAGAGAAGACATTCATGTACCACCTACTAGTCCCTCTCTCCCCAACCTCTGCCAGGGC 
AATCTCTAACTTCAATCCCGCCTTCGACAGTGAAAAAGCTCTACTTCTACGCTGACCCAGGGAGG 
AAACACTAGGACCCTGTTGTATCCTCAACTGCAAGTTTCTGGACTAGTCTCCCAACGTTTGCCTC 
CCAATGTTGTCCCTTTCCTTCGTTCCCATGGTAAAGCTCCTCTCGCTTTCCTCCTGAGGCTACAC 
CCATGCGTCTCTAGGAACTGGTCACAAAAGTCATGGTGCCTGCATCCCTGCCAAGCCCCCCTGAC 
CCTCTCTCCCCACTACCACCTTCTTCCTGAGCTGGGGGCACCAGGGAGAATCAGAGATGCTGGGG 
ATGCCAGAGCAAGAC TCAAAGAGGCAGAGGT TT TGTT CT CAAATATT TT T TAATAAATAGACGAA 
ACCACG 
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FIGURE 28 



MDILVPLLQLLVLLLTLPLHLMALLGCWQPLCKSYFPYLMAVLTPKSNRKMESKKRELFSQIKGL 
TGASGKVALLELGCGTGANFQFYPPGCRVTCLDPNPHFEKFLTKSMAENRHLQYERFWAPGEDM 
RQIADGSMDVWCTLVLCSVQSPRKVLQEVRRVLRPGGVLFFWEHVAEPYGSWAFMWQQVFEPTW 
KHIGDGCCLTRETWKDLENAQFSEIQMERQPPPLKWLPVGPHIMGKAVKQSFPSSKALICSFPSL 
QLEQATHQPIYLPLRGT 

Important features : 
Signal peptide: 

amino acids 1-23 

Leucine zipper pattern. 

amino acids 10-32 

N-rnyristoylation sites. 

amino acids 64-70, 78-84, 80-86, 91-97, 201-207 
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FIGURE 2Q 

CAATGTTTGCCTATCCACCTCCCCCAAGCCCCTTTACCTATGCTGCTGCTAACGCTGCTGCTGCT 
GCTGCTGCTGCTTAAAGGCTCATGCTTGGAGTGGGGACTGGTCGGTGCCCAGAAAGTCTCTTCTG 
CCACTGACGCCCCCATCAGGGATTGGGCCTTCTTTCCCCCTTCCTTTCTGTGTCTCCTGCCTCAT 
CGGCCTGCCATGACCTGCAGCCAAGCCCAGCCCCGTGGGGAAGGGGAGAAAGTGGGGGATGGCTA 
AGAAAGCTGGGAGATAGGGAACAGAAGAGGGTAGTGGGTGGGCTAGGGGGGCTGCCTTATTTAAA 
GTGGTTGTT TAT GAT T C T T ATAC T AAT T T AT AC AAAG AT AT T AAGGC C C T G T T C AT T AAG AAAT T 
GTTCCCTTCCCCTGTGTTCAATGTTTGTAAAGATTGTTCTGTGTAAATATGTCTTTATAATAAAC 
AGTTAAAAGCTGAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 30 

MLLLTLLLLLLLLKGSCLEWGLVGAQKVSSATDAPIRDWAFFPPSFLCLLPHRPAMTCSQAQPRG 
EGEKVGDG 



Important features: 
Signal peptide: 

amino acids 1-15 

M Growth factor and cytokines receptors family: 

amino acids 3-18 

o 
m 
w 

H 

w 
m 

o 

If! 
0 
CO 

o 
ru 
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FIGURE 31 

G T TT GAAT T CC T TCAACT ATACC CACAGT CCAAAAGCAGACT CAC TG T GT CCCAGGC TACCAG T T 
CCTCCAAGCAAGTCATTTCCCTTATTTAACCGATGTGTCCCTCAAACACCTGAGTGCTACTCCCT 
ATTTGCATCTGTTTTGATAAATGATGTTGACACCCTCCACCGAATTCTAAGTGGAATCATGTCGG 
GAAGAGATACAATCC T TGGCCT GT GT AT CC T CGCAT T AGCC T T GT C T T TGGCCATGAT GT T TAC C 
TTCAGATTCATCACCACCCTTCTGGTTCACATTTTCATTTCATTGGTTATTTTGGGATTGTTGTT 
TGTCTGCGGTGTTTTATGGTGGCTGTATTATGACTATACCAACGACCTCAGCATAGAATTGGACA 
CAGAAAGGGAAAATATGAAGTGCGTGCTGGGGTTTGCTATCGTATCCACAGGCATCACGGCAGTG 
CTGCTCGTCTTGATTT TT GT TC TCAGAAAGAGAATAAAATTGACAGTTGAGCTTTTCCAAAT CAC 
AAATAAAGCCATCAGCAGTGCTCCCTTCCTGCTGTTCCAGCCACTGTGGACATTTGCCATCCTCA 
TTTTCTTCTGGGTCCTCTGGGTGGCTGTGCTGCTGAGCCTGGGAACTGCAGGAGCTGCCCAGGTT 
ATGGAAGGCGGCCAAGTGGAATATAAGCCCCTTTCGGGCATTCGGTACATGTGGTCGTACCATTT 
AAT TGGCCT CATC TGGACTAGTGAAT T CATCCTTGCGT GCCAGCAAAT GACTATAGCTGGGGCAG 
T GGT TACT T G TTAT T T CAACAGAAGTAAAAATGATCC T CC T GAT CATCCCAT CCT TTCGTCTCTC 
TCCATTCTCTTCTTCTACCATCAAGGAACCGTTGTGAAAGGGTCATTTTTAATCTCTGTGGTGAG 
GAT TCC GAGAATCAT T GTCAT GT ACAT GCAAAAC GCACTGAAAGAACAGCAG CATGGTGCAT T GT 
CCAGGTACCTGTTCCGATGCTGCTACTGCTGTTTCTGGTGTCTTGACAAATACCTGCTCCATCTC 
AACCAGAAT GCAT ATACT ACAAC T GC T AT T AAT GGGACAGAT T TC T GT ACAT C AGC AAAAGAT GC 
AT TCAAAAT CTTGTCCAAGAACTCAAGTCACTTTACATCTAT TAACTGCTT TGGAGACTTCATAA 
T T T T TCTAGGAAAGGT GT T AGTGGT GT GTT T CAC T GT T T T T GGAGGACT CAT GGCT T T TAAC T AC 
AATCGGGCATTCCAGGTGTGGGCAGTCCCTCTGTTATTGGTAGCTTTTTTTGCCTACTTAGTAGC 
CCATAGTTTTTTATCTGTGTTTGAAACTGTGCTGGATGCACTTTTCCTGTGTTTTGCTGTTGATC 
TGGAAACAAATGATGGATCGTCAGAAAAGCCCTACTTTATGGATCAAGAATTTCTGAGTTTCGTA 
AAAAGGAGCAACAAAT TAAACAAT GCAAGGGCACAGC AGGACAAGCACT CAT TAAGGAATGAGGA 
GGGAACAGAACTCCAGGCCATTGTGAGATAGATACCCAT TTAGGTATCT GTACCTGGAAAACAT T 
T C C T T C T AAG AGC C AT T T AC AGAAT AG AAGAT GAG AC CAC T AGAG AAAAG T TAG T GAAT t T T T T T 
T TAAAAGACCTAATAAACCC TAT TC T T CC T CAAAA 
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FIGURE 32 

MSGRDT I LGLC I IALALSLAMMFT FRF I T T LLVH I FI S LVI LGLL FVCGVLWWL YYDYTNDLS I E 
LDTERENMKCVLGFAI VS TGI TAVLLVLI FVLRKRIKLTVELFQI TNKAI S SAPFLLFQPLWT FA 
ILIFFWVLWVAVLLSLGTAGAAQVMEGGQVEYKPLSGIRYMWSYHLIGLIWTSEFILACQQMTIA 
GAWTCYFNRSKNDPPDHPILS S LS I LFFYHQGTWKGS FL I SWRI PRI IVMYMQNALKEQQHG 
ALSRYLFRCCYCCFWCLDKYLLHLNQNAYTTTAINGTDFCTSAKDAFKILSKNSSHFTSINCFGD 
FI I FLGKVLWCFTVFGGLMAFNYNRAFQWAVPLLLVAFFAYLVAHS FLSVFETVLDALFLCFA 
VDLETNDGSSEKPYFMDQEFLSFVKRSNKLNNARAQQDKHSLRNEEGTELQAIVR 

Important features: 
Signal peptide: 

amino acids 1-20 

Putative transmembrane domains : 

amino acids 35-54, 75-97, 126-146, 185-204, 333-350, 352-371 

N-glycosylation sites. 

amino acids 204-208, 295-299, 313-317 

N-myristoylation sites. 

amino acids 147-153, 178-184, 196-202, 296-275, 342-348 
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FIGURE 33 

GTTCGATTAGCTCCTCTGAGAAGAAGAGAAAAGGTTCTTGGACCTCTCCCTGTTTCTTCCTTAGA 
AT AAT T T GT ATGGGAT T T GTGATGCAGGAAAGCC TAAGGGAAAAAGAAT AT TCAT TC T G TG T GG T 
GAAAATTTTTTGAAAAAAAAATTGCCTTCTTCAAACAAGGGTGTCATTCTGATATTTATGAGGAC 
TGTTGTTCTCACTATGAAGGCATCTGTTATTGAAATGTTCCTTGTTTTGCTGGTGACTGGAGTAC 
ATT C AAACAAAGAAAC G G C AAAGAAG AT T AAAAG GC C C AAG T T C AC T G T G C C T C AGAT C AAC T G C 
GATGTCAAAGCCGGAAAGATCATCGATCCTGAGTTCATTGTGAAATGTCCAGCAGGATGCCAAGA 
CCCCAAATACCATGTTTATGGCACTGACGTGTATGCATCCTACTCCAGTGTGTGTGGCGCTGCCG 
TACACAGTGGTGTGCTTGATAATTCAGGAGGGAAAATACTTGTTCGGAAGGTTGCTGGACAGTCT 
GGTTACAAAGGGAGTTATTCCAACGGTGTCCAATCGTTATCCCTACCACGATGGAGAGAATCCTT 
TATC GT C T TAGAAAGTAAAC CCAAAAAGGG T GTAAC C TACCCATCAGC T CT T ACAT ACT CAT CAT 
CGAAAAGTCCAGCTGCCCAAGCAGGTGAGACCACAAAAGCCTATCAGAGGCCACCTATTCCAGGG 
H" ACAACTGCACAGCCGGTCACTCTGATGCAGCTTCTGGCTGTCACTGTAGCTGTGGCCACCCCCAC 

0 CACCTTGCCAAGGCCATCCCCTTCTGCTGCTTCTACCACCAGCATCCCCAGACCACAATCAGTGG 
P GCCACAGGAGCCAGGAGAT GGAT C TC T GGT CCAC TGCCACC T ACACAAGCAGCCAAAACAGGCCC 
m AGAGCTGATCCAGGTATCCAAAGGCAAGATCCTTCAGGAGCTGCCTTCCAGAAACCTGTTGGAGC 
f 1 GGATGTCAGCCTGGGACTTGTTCCAAAAGAAGAATTGAGCACACAGTCTTTGGAGCCAGTATCCC 

T GGGAGAT C CAAACT GCAAAAT TGAC T TGT CGT T T T T AAT T GATGGGAGCACCAGCAT T GG CAAA 
SI CGGCGATTCCGAATCCAGAAGCAGCTCCTGGCTGATGTTGCCCAAGCTCTTGACATTGGCCCTGC 
111 CGGTCCACTGATGGGTGTTGTCCAGTATGGAGACAACCCTGCTACTCACTTTAACCTCAAGACAC 

1 p ACACGAATT CTCGAGATCTGAAGACAGCCATAGAGAAAAT TACTCAGAGAGGAGGACT TT CTAAT 

GTAGGTCGGGCCATCTCCTTTGTGACCAAGAACTTCTTTTCCAAAGCCAATGGAAACAGAAGCGG 
GGCTCCCAATGTGGTGGTGGTGATGGTGGATGGCTGGCCCACGGACAAAGTGGAGGAGGCTTCAA 
O GACTTGCGAGAGAGTCAGGAATCAACATTTTCTTCATCACCATTGAAGGTGCTGCTGAAAATGAG 
||1 AAGCAGTATGTGGTGGAGCCCAACTTTGCAAACAAGGCCGTGTGCAGAACAAACGGCTTCTACTC 
f*\ GCTCCACGTGCAGAGCTGGTTTGGCCTCCACAAGACCCTGCAGCCTCTGGTGAAGCGGGTCTGCG 

2 ACACTGACCGCCTGGCCTGCAGCAAGACCTGCTTGAACTCGGCTGACATTGGCTTCGTCATCGAC 
5f- GGCTCCAGCAGTGTGGGGACGGGCAACTTCCGCACCGTCCTCCAGTTTGTGACCAACCTCACCAA 
Q AGAGTTTGAGATTTCCGACACGGACACGCGCATCGGGGCCGTGCAGTACACCTACGAACAGCGGC 
ft! TGGAGTTTGGGTTCGACAAGTACAGCAGCAAGCCTGACATCCTCAACGCCATCAAGAGGGTGGGC 

TACTGGAGTGGTGGCACCAGCACGGGGGCTGCCATCAACTTCGCCCTGGAGCAGCTCTTCAAGAA 

GTCCAAGCCCAACAAGAGGAAGTTAATGATCCTCATCACCGACGGGAGGTCCTACGACGACGTCC 

GGATCCCAGCCATGGCTGCCCATCTGAAGGGAGTGATCACCTATGCGATAGGCGTTGCCTGGGCT 

GCCCAAGAGGAGCTAGAAGTCATTGCCACTCACCCCGCCAGAGACCACTCCTTCTTTGTGGACGA 

GTTTGACAACCTCCATCAGTATGTCCCCAGGATCATCCAGAACATTTGTACAGAGTTCAACTCAC 

AGCC T C GGAACTGAAT TCAGAGCAGGCAGAGCAC CAGCAAG T GC T GC T T TAC TAAC TGACGT GT T 

GGACCACCCCACCGCT TAAT GGGGCACGCACGGTGCAT CAAGTCT TGGGCAGGGCATGGAGAAAC 

AAATGTCTTGTTATTATTCTTTGCCATCATGCTTTTTCATATTCCAAAACTTGGAGTTACAAAGA 

TGATCACAAACGTATAGAATGAGCCAAAAGGCTACATCATGTTGAGGGTGCTGGAGATTTTACAT 

T T T GACAAT T GTT T TCAAAATAAAT GT T CGGAAT ACAGT GCAGCC C T TAC GACAGGC T T ACGT AG 

AGCTTTTGTGAGATTTTTAAGTTGTTATTTCTGATTTGAACTCTGTAACCCTCAGCAAGTTTCAT 

TT T TGT CAT GACAATGT AGGAATTGCT GAAT TAAATGTT TAGAAGGAT GAAAAATAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAG 
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FIGURE 3d 



MRTWLTMKASVIEMFLVLLVTGWSNKETAKKIKRPKFTVPQIKCDVKAGKIIDPEFIVKCPAG 
CQDPKYHVYGTDVYASYSSVCGAAVHSGVLDNSGGKILVRKVAGQSGYKGSYSNGVQSLSLPRWR 

E S FI VLE SKPKKGVT YPSALTYSS S KSPAAQAGETTKAYQRPP I PGT TAQPVTLMQLLAVTVAVA 
TPTTLPRPSPSAASTTSIPRPQSVGHRSQEMDLWSTATYTSSQNRPRADPGIQRQDPSGAAFQKP 

VGADVS LGLVPKEEL S TQSLEPVS LGDPNCKI DL S FL I DGS T S I GKRRFRI QKQLLADVAQALD I 
GPAGPLMGWQYGDNPATHFNLKTHTNSRDLKTAIEKITQRGGLSNVGRAISFVTKNFFSKANGN 

RS GAPNVVVVMVDGWPT DKVEEASRLARE SG IN I FF I T IEGAAENEKQYWE PN FANKAVCRTNG 
FYSLHVQSWFGLHKTLQPLVKRVCDTDRLACSKTCLNSADIGFVIDGSSSVGTGNFRTVLQFVTN 
LTKEFEISDTDTRIGAVQYTYEQRLEFGFDKYSSKPDILNAIKRVGYWSGGTSTGAAINFALEQL 
FKKS KPNKRKLM I L I TDGRSYDDVRI PAMAAHLKGVIT YAI GVAWAAQEELEVI ATHPARDHS FF 
VDEFDNLHQYVPRIIQNICTEFNSQPRN 

Important features; 
Signal peptide: 
amino acids 1-26 

Transmembrane domain: 

amino acids 181-200 

N-glycosylation sites. 

amino acids 390-394, 520-524 

N-myristoylation sites. 

amino acids 23-29, 93-99, 115-121, 262-268, 367-373, 389-395, 
431-437, 466-472, 509-515, 570-576, 571-577, 575-581, 627-633 

Amidation site. 

amino acids 304-308 



APPJD-l 0063735 



Page 188 



FIGURE 35 



CCGAGC AC AGGAGAT TGC CT GCGT TT AGGAGGT GGC TGCGTT GT GGGAAAAGCT AT C AAG GAAG AAAT TGC 
CAAACCATGTCTTTTTTTCTGTTTTCAGAGTAGTTCACAACAGATCTGAGTGTTTTAATTAAGCATGGAAT 
AC AGAAAAC AAC AAAAAACTT AAGCTTT AATT T CAT CT GGAATT CCAC AGTT T T CT T AGCT C CC T GGAC CC 
GGTTGACCTGTTGGCTCTTCCCGCTGGCTGCTCTATCACGTGGTGCTCTCCGACTACTCACCCCGAGTGTA 
AAGAACCTTCGGCTCGCGTGCTTCTGAGCTGCTGTGGATGGCCTCGGCTCTCTGGACTGTCCTTCCGAGTA 
GGATGTCACTGAGATCCCTCAAATGGAGCCTCCTGCTGCTGTCACTCCTGAGTTTCTTTGTGATGTGGTAC 
CTCAGCCTTCCCCACTACAATGTGATAGAACGCGTGAACTGGATGTACTTCTATGAGTATGAGCCGATTTA 

CAGACAAGACTT TC AC TT CAC ACT TCGAGAGCATTC AAACTGCT CTC ATCAAAATC CAT T TCT GGT CATT C 
J! TGGTGACCTCCCACCCTTCAGATGTGAAAGCCAGGCAGGCCATTAGAGTTACTTGGGGTGAAAAAAAGTCT 
TGGT GGGGAT AT GAGGTTCTTACATTTTTCTTATT AGGCCAAGAGGCT GAAAAGGAAGAC AAAATGTTGGC 
ff| ATTGTCCTTAGAGGATGAACACCTTCTTTATGGTGACATAATCCGACAAGATTTTTTAGACACATATAATA 
M ACCT GACCT T GAAAAC CATT AT GGC ATT CAGGT GGGT AACTG AGT TTT GC C CCAAT GCC AAGT ACGT AAT G 

SI AAGACAGACACTGATGTTTTCATCAATACTGGCAATTTAGTGAAGTATCTTTTAAACCTAAACCACTCAGA 
Llj GAAGTTTTTCACAGGTTATCCTCTAATTGATAATTATTCCTATAGAGGATTTTACCAAAAAACCCATATTT 
ifl CTTACCAGGAGTATCCTTTCAAGGTGTTCCCTCCATACTGCAGTGGGTTGGGTTATATAATGTCCAGAGAT 
* TTGGTGCCAAGGATCTATGAAATGATGGGTCACGTAAAACCCATCAAGTTTGAAGATGTTTATGTCGGGAT 
^ CTGTTTGAATTTATTAAAAGTGAACATTCATATTCCAGAAGACACAAATCTTTTCTTTCTATATAGAATCC 
Z; ATTTGGATGTCTGTCAACTGAGACGTGTGATTGCAGCCCATGGCTTTTCTTCCAAGGAGATCATCACTTTT 
J£| TGGC AGGT C ATGCT AAGGAACAC C ACAT GCCATT AT TAACTT C AC ATT CT AC AAAAAGCCT AG AAGGAC AG ■ 

K GATACCTTGTGGAAAGTGTTAAATAAAGTAGGTACTGTGGAAAATTCATGGGGAGGTCAGTGTGCTGGCTT 
ffj ACACTGAACTGAAACTCATGAAAAACCCAGACTGGAGACTGGAGGGTTACACTTGTGATTTATTAGTCAGG 
CCCT TC AAAG AT GATATGTGGAGGAAT T AAAT AT AAAGGAAT T GGAGGTTT T TGCT AAAGAAATT AAT AGG 
ACCAAACAATTTGGACATGTCATTCTGTAGACTAGAATTTCTTAAAAGGGTGTTACTGAGTTATAAGCTCA 
CTAGGCTGTAAAAACAAAACAATGTAGAGTTTTATTTATTGAACAATGTAGTCACTTGAAGGTTTTGTGTA 
TATCTTATGTGGATTACCAATTTAAAAATATATGTAGTTCTGTGTCAAAAAACTTCTTCACTGAAGTTATA 
CTGAACAAAATTTTACCTGTTTTTGGTCATTTATAAAGTACTTCAAGATGTTGCAGTATTTCACAGTTATT 
AT T ATT T AAAAT T ACT TC AACT TT GT GT T TTT AAAT GT TT T GACGATT T CAAT AC AAGAT AAAAAGGAT AG 
TGAATCATTCTTTACATGCAAACATTTTCCAGTTACTTAACTGATCAGTTTATTATTGATACATCACTCCA 
TTAATGTAAAGTCATAGGTCATTATTGCATATCAGTAATCTCTTGGACTTTGTTAAATATTTTACTGTGGT 
AAT AT AGAGAAGAAT T AAAG CAAGAAAAT CTGAAAA 
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FIGURE 36 



MASALWTVLPSRMSLRSLKWSLLLLS^ 

TLREHSNCSHQNPFLVILVTSHPSDVKARQAIRVTWGEKKSWWGYEVLTFFLLGQEAEKEDKMLA 
LSLEDEHLLYGDIIRQDFLDTYNNLTLKTIMAFRWVTEFCPNAKYVMKTDTDVFINTGNLVKYLL 
NLNHSEKFFTGYPLIDNYSYRGFYQKTHISYQEYPFKVFPPYCSGLGYIMSRDLVPRIYEMMGHV 
KPIKFEDVYVGICLNLLKYNIHIPEDTNLFFLYRIHLDVCQLRRVIAAHGFSSKEIITFWQVMLR 

NTTCHY 

Important features: 

Type II transmembrane domain: 

amino acids 20-39 

N-glycosylation sites. 

amino acids 72-76, 154-158, 198-202, 212-216, 326-330 

Glycosaminoglycan attachment site. 

amino acids 239-243 

Ly-6 / u-PAR domain proteins. 

amino acids 23-37 

N-myristoylation site. 

amino acids 271-277 
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FIGURE 37 



CGCTCGGGCACCAGCCGCGGCAAGGATGGAGCTGGGTTGCTGGACGCAGTTGGGGCTCACTTTTCTTCAGCTCCTTCTCATC 
TCGTCCTTGCCAAGAGAGTACACAGTCATTAATGAAGCCTGCCCTGGAGCAGAGTGGAATATCATGTGTCGGGAGTGCTGTG 
AATATGATCAGATTGAGTGCGTCTGCCCCGGAAAGAGGGAAGTCGTGGGTTATACCATCCCTTGCTGCAGGAATGAGGAGAA 
TGAGTGTGACTCCTGCCTGATCCACCCAGGTTGTACCATCTTTGAAAACTGCAAGAGCTGCCGAAATGGCTCATGGGGGGGT 
ACCTTGGATGACTTCTATGTGAAGGGGTTCTACTGTGCAGAGTGCCGAGCAGGCTGGTACGGAGGAGACTGCATGCGATGTG 
GCC AGGT T CTGC GAGCCCC AAAGGGTC AGAT TT TGT TGGAAAGC TATCCCC T AAAT GCTCAC TG TGAAT GGACCAT TC AT GC 
TAAACCTGGGTTTGTCATCCAACTAAGATTTGTCATGTTGAGTCTGGAGTTTGACTACATGTGCCAGTATGACTATGTTGAG 

G TTCGTGATGGAGACAACC GCGATGGCCAGATCATCAAGCGTGTC TGTGGCAACGAGCGGC CAGC T CC TATCCAGAGC AT AG 
GATCCTCACTCCACGTCCTCTTCCACTCCGATGGCTCCAAGAATTTTGACGGTTTCCATGCCATTTATGAGGAGATCACAGC 
ATGCTCCTCATCCCCTTGTTTCCATGACGGCACGTGCGTCCTTGACAAGGCTGGATCTTACAAGTGTGCCTGCTTGGCAGGC 
TATACTGGGCAGCGCTGTGAAAATCTCCTTGAAGAAAGAAACTGCTCAGACCCTGGGGGCCCAGTCAATGGGTACCAGAAAA 
TAACAGGGGGCCCTGGGCTTATCAACGGACGCCATGCTAAAATTGGCACCGTGGTGTCTTTCTTTTGTAACAACTCCTATGT 
TCTTAGTGGCAATGAGAA^GAACTTGCCAGCAGAATGGAGAGTGGTCAGGGAAACAGCCCATCTGCATAAAA.GCCTGCCGA 
GAACCAAAGATTTCAGACCTGGTGAGAAGGAGAGTTCTTCCGATGCAGGTTCAGTCAAGGGAGACACCATTACACCAGCTAT 
ACTC AGCGGCC T TCAGCAAGCAGAAAC TGCAGAGTGCCCCTACCAAGAAGCCAGCCC T TCC C T T TGGAGATCT GC C C ATGGG 
ATACCAACATCTGCATACCCAGCTCCAGTATGAGTGCATCTCACCCTTCTACCGCCGCCTGGGCAGCAGCAGGAGGACATGT 
CTGAGGACTGGGAAGTGGAGTGGGCGGGCACCATCCTGCATCCCTATCTGCGGGAAAATTGAGAACATCACTGCTCCAAAGA 
CCCAAGGGT TGCGCT GGCCGTGGC AGGCAGC CATC TAC AGGAGGACC AGCGGGGTGCATGACGGC AGCC TACACAAGGGAGC 
GTGGTTCCTAGTCTGCAGCGGTGCCCTGGTGAATGAGCGCACTGTGGTGGTGGCTGCCCACTGTGTTACTGACCTGGGGAAG 
GTCACCATGAT C AAGAC AGCAGACC TGAAAGT TGTT T TGGGGAAAT TC TACCGGGATGATGACCGGGAT GAGAAGAC CAT CC 
AGAGCCTACAGATTTCTGCTATCATTCTGCATCCCAACTATGACCCCATCCTGCTTGATGCTGACATCGCCATCCTGAAGCT 
CCTAGACAAGGCCCGTATCAGCACCCGAGTCCAGCCCATCTGCCTCGCTGCCAGTCGGGATCTCAGCACTTCCTTCCAGGAG 
TCCCACATCACTGTGGCTGGCTGGAATGTCCTGGCAGACGTGAGGAGCCCTGGCTTCAAGAACGACACACTGCGCTCTGGGG 
TGGTCAGTGTGGTGGACTCGCTGCTGTGTGAGGAGCAGCATGAGGACCATGGCATCCCAGTGAGTGTCACTGATAACATGTT 
CTGTGCCAGCTGGGAACCCACTGCCCCTTCTGATATCTGCACTGCAGAGACAGGAGGCATCGCGGCTGTGTCCTTCCCGGGA 
CGAGCATCTCCTGAGCCACGCTGGCATCTGATGGGACTGGTCAGCTGGAGCTATGATAAAACATGCAGCCACAGGCTCTCCA 
CTGCCTTCACCAAGGTGCTGCCTTTTAAAGACTGGATTGAAAGAAATATGAAAT^ACCATGCTCATGCACTCCTTGAGAAG 
TGTTTCTGTATATCCGTCTGTACGTGTGTCATTGCGTGAAGCAGTGTGGGCCTGAAGTGTGATTTGGCCTGTGAACTTGGCT 
■ GTGCCAGGGCTTCTGACTTCAGGGACAAAACTCAGTGAAGGGTGAGTAGACCTCCATTGCTGGTAGGCTGATGCCGCGTCCA 
C TAC TAGGACAGCCAAT TGGAAGAT GCC AGGGC TT GCAAGAAGTAAGT T T C T T CAAAGAAGACCATATACAAAAC C T C TCC A 
CTCCACTGACCTGGTGGTCTTCCCCAACTTTCAGTTATACGAATGCCATCAGCTTGACCAGGGAAGATCTGGGCTTCATGAG 
GCCCCTTTTGAGGCTCTCAAGTTCTAGAGAGCTGCCTGTGGGACAGCCCAGGGCAGCAGAGCTGGGATGTGGTGCATGCCTT 
TGTGTACATGGCCACAGTACAGTCTGGTCCTTTTCCTTCCCCATCTCTTGTACACATTTTAATAAAATAAGGGTTGGCTTCT 

GAACTACAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^ 
AAAAAAAAAAAAAAAAAAAAAAAAAAARAAAAAAAAAAAAAAA^ 
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FIGURE 38 

MELGCWTQLGLTFLQLLLISSLPREYTVINEACPGAEWNIMCRECCEYDQIECVCPGKREWG'YT 
I PCCRNEENECDSCL I HPGCT I FENCKSCRNGSWGGTLDDFYVKGFYCAECRAGWYGGDCMRCGQ 
VLRAPKGQILLESYPLNAHCEWTIHAKPGFVIQLRFVMLSLEFDYMCQYDYVEVRDGDNRDGQII 
KRVCGNERPAPIQSIGSSLHVLFHSDGSKNFDGFHAIYEEITACSSSPCFHDGTCVLDKAGSYKC 
ACLAGYTGQRCENLLEERNCSDPGGPVNGYQKITGGPGLINGRHAKIGTWSFFCNNSYVLSGNE 
KRTCQQNGEWSGKQPICIKACREPKISDLVRRRVLPMQVQSRETPLHQLYSAA.FSKQKLQSAPTK 
KPALP FGDLPMG YQHLHTQLQYEC I S P FYRRLGS SRRT CLRTGKWSGRAPS C I P I CGKI EN I TAP 
KT QGLRWPW QAAI YRRT SGVHDG S LHKGAW FL VC S GALVNERT VWAAHC VT DLGKVTMI KTADL 
KWLGKFYRDDDRDE KTIQSLQI S AI I LHPNYDP I LLDAD I AI LKLLDKARI STRVQPI CLAAS R 
DLSTS FQE SH I TVAGWNVLADVRS PGFKNDTLRSGWSWDSLLCEEQHEDHGI PVSVTDNMFCA 
SWEPTAPSDICTAETGGIAAVSFPGRASPEPRWHLMGLVSWSYDKTCSHRLSTAFTKVLPFKDWI 

ERNMK 

Important features of the protein: 
Signal peptide: 

amino acids 1-23 

EGF-like domain cysteine pattern signature. 

amino acids 260-272 
N-glycosylation sites. 

amino acids 96-100, 279-283, 316-320, 451-455, 614-618 
N-myristoylation sites. 

amino acids 35-41, 97-103, 256-262, 284-290, 298-304, 308-314, 

474-480, 491-497, 638-644, 666-672 

Amidation site. 

amino acids 56-60 

Serine proteases, trypsin family. 

amino acids 489-506 

CUB domain proteins profile. 

amino acids 150-167 
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FIGURE 3Q 



GGTTCCTACATCCT CTCAT CTGAGAATCAGAGAGCATAAT CTTCTTACGGGCCCGT GATTTATTAACGTGGCTTAATC 

TGAAGGTTCTCAGTCAAATTCTTTGTGATCTACTGATTGTGGGGGCATGGCAAGGTTTGCTTAAAGGAGCTTGGCTGG 

TTTGGGCCCTTGTAGCTGACAGAAGGTGGCCAGGGAGAATGCAGCACACTGCTCGGAGAATGAAGGCGCTTCTGTTGC 

TGGTCTTGCCTTGGCTCAGTCCTGCTAA.CTACATTGACAATGTGGGCAACCTGCACTTCCTGTATTCAGAACTCTGTA 

AAGGTGCCTCCCACTACGGCCTGACCAAAGATAGGAAGAGGCGCTCACAAGATGGCTGTCCAGACGGCTGTGCGAGCC 

TCACAGCCACGGCTCCCTCCCCAGAGGTTTCTGCAGCTGCCACCATCTCCTTAATGACAGACGAGCCTGGCCTAGACA 

ACCCTGCCTACGTGTCCTCGGCAGAGGACGGGCAGCCAGCAATCAGCCCAGTGGACTCTGGCCGGAGCAACCGAACTA 

GGGCACGGCCCTTTGAGAGATCCACTATTAGAAGCAGATCATTTAAAAAAATAAATCGAGCTTTGAGTGTTCTTCGAA 

GGACAAAGAGCGGGAGTGCAGTTGCCAACCATGCCGACCAGGGCAGGGAAAATTCTGAAAACACCACTGCCCCTGAAG 

TCTTTCCAAGGTTGTACCACCTGATTCCAGATGGTGAAATTACCAGCATCAAGATCAATCGAGTAGATCCCAGTGAAA 

GCCTCTCTATTAGGCTGGTGGGAGGTAGCGAAACCCCACTGGTCCATATCATTATCCAACACATTTATCGTGATGGGG 

TGAT CGCCAGAGACGGCCGGCTACTGCCAGGAGACATCATTCTAAAGGT CAACGGGAT GGAC ATCAGCAATGTCCCTC 

ACAACTACGCTGTGCGTCTCCTGCGGCAGCCCTGCCAGGTGCTGTGGCTGACTGTGATGCGTGAACAGAAGTTCCGCA 

GCAGGAACAATGGACAGGCCCCGGATGCCTACAGACCCCGAGATGACAGCTTTCATGTGATTCTCAACAAAAGTAGCC 

CCGAGGAGCAGCTTGGAATAAAACTGGTGCGCAAGGTGGATGAGCCTGGGGTTTTCATCTTCAATGTGCTGGATGGCG 

GTGTGGCATATCGACATGGTCAGCTTGAGGAGAATGACCGTGTGTTAGCCATCAATGGACATGATCTTCGATATGGCA 

GCCCAGAAAGTGCGGCTCATCTGATTCAGGCCAGTGAAAGACGTGTTCACCTCGTCGTGTCCCGCCAGGTTCGGCAGC 

GGAGCCCTGACATCTTTCAGGAAGCCGGCTGGAACAGCAATGGCAGCTGGTCCCCAGGGCCAGGGGAGAGGAGCAACA 

CTCCCAAGCCCCTCCATCCTACAATTACTTGTCATGAGAAGGTGGTAAATATCCAAAAAGACCCCGGTGAATCTCTCG 

GCATGACCGTCGCAGGGGGAGCATCACATAGAGAATGGGATTTGCCTATCTATGTCATCAGTGTTGAGCCCGGAGGAG 

T CAT AAGCAGAGAT GGAAGAAT AAAAACAGGT GACAT TTT GT TGAAT GT GGAT GG GGT C GAACT GACAGAGGT CAGC C 

GGAGT GAGGCAGTGGCATT AT T GAAAAGAACATCAT CCT CGATAGTACT CAAAGCT TTG GAAGT CAAAGAGTAT GAGC 

CCCAGGAAGACTGCAGCAGCCCAGCAGCCCTGGACTCCAACCACAACATGGCCCCACCCAGTGACTGGTCCCCATCCT 

GGGT CATGT GGCT GGAATTACCACGGT GCTT GTATAACTGTAAAGATAT T GT AT TACGAAGAAACACAG CT GGAAGT C 

TGGGCTTCTGCATTGTAGGAGGTTATGAAGAATACAATGGAAACAAACCTTTTTTCATCAAATCCATTGTTGAAGGAA 

CACCAGCAT ACAAT GAT GGAAGAATTAGAT GT GGTGATATT CTT CTT GCTGT CAATGGT AGAAGTACAT CAGGAATGA 
TACATGCTTGCTTGGCAAGACTGCTGAAAGAACTTAAAGGAAGAATTACTCTAACTATTGTTTCTTGGCCTGGCACTT 
TT TTATAGAATCAAT GATGGGT CAGAGGAAAACAGAAAAAT CACAAATAGGCTAAGAAG TT GAAACACT ATATTTAT C 
TTGT CAGTTTTTATATTTAAAGAAAGAATACATT GTAAAAAT GT CAGGAAAAGTAT GAT CAT CT AAT GAAAGCCAGTT 
ACACCTCAGAAAATATGATTCCAAAAAAATTAAAACTACTAGTTTTTTTTCAGTGTGGAGGATTTCTCATTACTCTAC 
AACATTGTTTATATTTTTTCTATTCAATAAAAAGCCCTAAAACAACTAAAATGATTGATTTGTATACCCCACTGAATT 
CAAGCT GAT TTAAATTT AAAATTT GGTATATGCT GAAGT CTGCCAAGGGTACATTATGGC CATT T TT AATTTACAGCT 
AAAATATTTTTTAAAATGCATTGCTGAGAAACGTTGCTTTCATCAAACAAGAATAAATATTTTTCAGAAGTTAAA 



FIGURE AO 



MKALLLLVLPWLS PANYI DNVGNLHFLYSELCKGASHYGLTKDRKRRS QDGCPDGCASLTATAP S 
PEVSAAATISLMTDEPGLDNPAYVSSAEDGQPAISPVDSGRSNRTRARPFERSTIRSRS FKKINR 
AL S VLRRT KS GS AVANHADQGRENS ENT T APE VFPRLYHL I PDGE I T S I K I NRVDP SE S LSI RLV 
GGSETPLVHIIIQHIYRDGVIARDGRLLPGDIILKTOGMDISWPHNYAVRLLRQPCQVLWLTVM 
REQK FRSRNNGQAPDAYRPRDDS FHVI LNKS S PEEQLG I KLVRKVDE PGVF I FNVLDGGVAYRHG 
QLEENDRVLAINGHDLRYGSPESAAHLIQASERRVHLWSRQVRQRSPDIFQEAGWNSNGSWSPG 
PGERSNTPKPLHPTITCHEKWNIQKDPGESLGMTVAGGASHREWDLPIYVISVEPGGVISRDGR 
I KTGDI LLNVDGVELTEVSRSEAVALLKRTS S S I VLKALE VKE YEPQE DC S S PAALDSNHNMAPP 
SDWSPSWVMWLELPRCLYNCKDIVLRRNTAGSLGFCIVGGYEEYNGNKPFFIKSIVEGTPAYNDG 
RIRCGDILLAVNGRSTSGMIHACLARLLKELKGRITLTIVSWPGTFL 

Important features: 
Signal peptide: 

amino acids 1-15 

N-glycosylation sites. 

amino acids 108-112, 157-161, 289-293, 384-388 

Tyrosine kinase phosphorylation sites . 

amino acids 433-441, 492-500 

N-myristoylation sites. 

amino acids 51-57, 141-147, 233-239, 344-350, 423-429, 447-453, 
467-473, 603-609 
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FIGURE 41 



ACCAGGCATTGTATCTTCAGTTGTCATCAAGTTCGCAATCAGATTGGAAAA.GCTCAACTTGAAGCTTT 
CTTGCCTGCAGTGAAGCAGAGAGATAGATATTATTCACGTAATAAAAAACATGGGCTTCAACCTGACT 
TTCCACCTTTCCTACAAATTCCGATTACTGTTGCTGTTGACTTTGTGCCTGACAGTGGTTGGGTGGGC 
CACCAGTAACTACTTCGTGGGTGCCATTCAAGAGATTCCTAAAGCAAAGGAGTTCATGGCTAATTTCC 
ATAAGACC C T CAT T T T GGG GAAG GGAAAAAC T C T G AC T AAT GAAG CAT C CAC G AAGAAG GT AGAACT T 
GACAACT GTCCTTCTGT GT CT CCT T ACCT CAGAGGCCAGAGCAAGCT CATT T T CAAACCAGATCTCAC 
TTTGGAAGAGGTACAGGCAGAAAATCCCAAAGT GTC CAGAGGCCGGT AT CGCC CT CAGGAATGTAAA.G 
CTT T ACAGAGGGTCGC C AT CCTCGT T C CC C ACCGGAACAGAGAGAAACACCTGATGTACCT GCT GGAA 
CATCTGCATCCCTTCCTGCAGAGGCAGCAGCTGGATTATGGCATCTACGTCATCCACCAGGCTGAAGG 

TAAAA&GT TTAATCGAGCCAAACT CTT GAATGT GGGC TAT CTAGAAGCCCT CAAGGAAGAAAATTGGG 
ACTGCTTTATATTCCACGATGTGGACCTGGTACCCGAGAATGACTTTAACCTTTACAAGTGTGAGGAG 
CATCCCAAGCATCTGGTGGTTGGCAGGAACAGCACTGGGTACAGGTTACGTTACAGTGGATATTTTGG 
GGGTGTTACTGCCCTAAGCAGAGAGCAGTTTTTCAAGGTGAATGGATTCTCTAACAACTACTGGGGAT 
GGGGAGGCGAAGACGATGACCTCAGACTCAGGGTTGAGCTCCAAAGAATGAAAATTTCCCGGCCCCTG 
CCTGAAGT GGGTAAATATACAAT GGTCTTCCACACTAGAGACAAAGGCAATGAGGTGAACGCAGAACG 
GATGAAGCTCTTACACCAAGTGTCACGAGTCTGGAGAACAGATGGGTTGAGTAGTTGTTCTTATAAAT 
TAGTATCT GTGGAACACAATCCT T TATAT ATCAACATCACAGT GGAT T TCT GGTTT GGTGCATGACCC 
TGGATCTTTTGGTGATGTTTGGAAGAACTGATTCTTTGTTTGCAATAATTTTGGCCTAGAGACTTCAA 
AT AGT AGCACACATTAAGAAC CT GT T ACAGC T CAT T G T T GAGC T GAAT TTTTCCTTTTTG TAT T T T C T 
T AGCAGAGCT CCTGGT GAT GT AGAGTAT AAAACAGT TGTAACAAGAC AGCT TT CTTAGTCATTT TGAT 
CATGAGGGTTAAATATTGTAATATGGATACTTGAAGGACTTTATATAAAAGGATGACTCAAAGGATAA 
AAT GAACGCTAT T TGAGGACT CT GGT T GAAGGAGAT T TAT TTAAATT TGAAGT AATATAT TATGGGAT 
AAAAGGCCACAGGAAATAAGACT GC TGAATGT CTGAGAGAACCAGAGT T GT TCT CGT CCAAGGT AGAA 
AGGTACGAAGATACAATACTGTTAT TCAT TTATCCTGTACAAT CATCTGTGAAGTGGTGGTGTCAGGT 
GAGAAGGC GTC CACAAAA.GAG GGGAGAAAAGG C GACGAAT C AGGACAC AGT GAA.C T T GGGAAT G AAGA 
GGTAGCAGGAGGGTGGAGTGTCGGCTGCAAAGGCAGCAGTAGCTGAGCTGGTTGCAGGTGCTGATAGC 

CTT C AGGG GAGGAC C T GCC C AGG T AT GC C TT C C AG T GAT G C CC AC CAGAGAAT ACAT T C T C TAT TAGT 
TTTTAAAGAGTTTTTGTAAAATGATTTTGTACAAGTAGGATATGAATTAGCAGTTTACAAGTTTACAT 

AT T AAC TAAT AAT AAATAT G T CT AT CAAATAC C T CT GTAG T AAAATGT G AAAAAGCAAAA 



FIGURE 42 



MGFNLTFHLSYKFRLLLLLTLCLTWGWATSNYFVGAIQEIPKAKEFKANFHKTLILGKGKTLTN 
EASTKKVELDNCPSVSPYLRGQSKLIFKPDLTLEEVQAENPKVSRGRYRPQECKALQRVAILVPH 
RNREKHIMYLLEHLHPFLQRQQLDYGIYVIHQAEGKKFNRAKLLNVGYLEALKEENWDCFIFHDV 
DLVPENDFNLYKCEEHPKHLWGRNSTGYRLRYSGYFGGVTALSREQFFKVKGFSNNYWGWGGED 
DDLRLRVELQRKKISRPLPEVGKYTMVFHTRDKGNEWAERMKLLHQVSRVWRTDGLSSCSYKLV 

SVEHNPLYINITVDFWFGA 

Important features: 
Signal peptide: 

amino acids 1-27 

N-glycosylation sites. 

amino acids 4-8, 220-224, 335-339 

Xylose isomerase proteins. 

amino acids 191-202 
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FIGURE 43 

GCTCAAGACCCAGCAGTGGGACAGCCAGACAGACGGCACGATGGCACT GAGC TCCCAGATCTGGG 

CCGCTTGCCTCCTGCTCCTCCTCCTCCTCGCCAGCCTGACCAGTGGCTCTGTTTTCCCACAACAG 

ACGGGACAACTT GCAGAGC TGCAACCCCAGGACAGAGCT GGAGCCAGGGC CAGC T GGAT GCCCAT 

GTTCCAGAGGCGAAGGAGGCGAGACACCCACTTCCCCATCTGCATTTTCTGCTGCGGCTGCTGTC 

ATCGATCAAAGTGTGGGATGTGCTGCAAGACGTAGAACCTACCTGCCCTGCCCCCGTCCCCTCCG 

TTCCTTATTTATTCCTGCTGCCCCAGAACATAGGTCTTGGAATAAAATGGCTGGTTCTTTTGTTT 

TCCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 44 

MALSSQIWAACLLLLLLLASLTSGSVFPQQTGQLAELQPQDRAGARASWMPMFQRRRRRDTHFPI 
C I FCCGCCHRSKCGMCCKT 



Important features: 
Signal peptide: 

amino acids 1-24 

cAMP- and cGMP -dependent protein kinase phosphorylation site. 

amino acids 58-59 




pi 



N-myristoylation site. 



amino acids 4 4-50 



in 



Prokaryotic membrane lipoprotein lipid attachment site. 




amino acids 1-12 
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FIGURE 45 

GTGGCTTCATTTCAGTGGCTGACTTCCAGAGAGCAATATGGCTGGTTCCCCAACATGCCTCACCC 
TCATCTATATCCTTTGGCAGCTCACAGGGTCAGCAGCCTCTGGACCCGTGAAAGAGCTGGTCGGT 
TCCGTTGGTGGGGCCGTGACTTTCCCCCTGAAGTCCAAAGTAAAGCAAGTTGACTCTATTGTCTG 
GACC TT CAACAC AAC C CC T CTTGTCACCATACAGCCAGAAGGGGGCAC TAT CATAGTGACC CAAA 
AT CGTAATAGGGAGAGAGTAGAC T TCCCAGATGGAGGC TACT CC C TGAAGC T CAGCAAACT GAAG 
AAGAAT GAC TCAGGGAT CT AC TATGT GGGGATATACAGCTCATCACTCCAGCAGCCCT C CAC CCA 
GGAGTACGTGCTGCATGTCTACGAGCACCTGTCAAAGCCTAAAGTCACCATGGGTCTGCAGAGCA 
ATAAGAATGGCACCTGTGTGACCAATCTGACATGCTGCATGGAACATGGGGAAGAGGATGTGATT 
TATACCTGGAAGGCCCTGGGGCAAGCAGCCAATGAGTCCCATAATGGGTCCATCCTCCCCATCTC 
CTGGAGATGGGGAGAAAGTGATATGACCTTCATCTGCGTTGCCAGGAACCCTGTCAGCAGAAACT 
TCTCAAGCCCCATCCTTGCCAGGAAGCTCTGTGAAGGTGCTGCTGATGACCCAGATTCCTCCATG 
GTCCTCCTGTGTCTCCTGTTGGTGCCCCTCCTGCTCAGTCTCTTTGTACTGGGGCTATTTCTTTG 
GT T T C T GAAGAGAGAGAGACAAGAAGAG T AC AT TGAAGAGAAGAAGAG AG TGGAC AT TTGT CGGG 
AAACT C C TAAC AT ATGCCC CCAT T CT GGAGAGAACACAGAGTACGACACAAT CCCT CACAC T AAT 
AGAACAATCCTAAAGGAAGATCCAGCAAATACGGTTTACTCCACTGTGGAAATACCGAAAAAGAT 
GGAAAATCCCCACTCACTGCTCACGATGCCAGACACACCAAGGCTATTTGCCTATGAGAATGTTA 
TCTAGACAGCAGTGCACTCCCCTAAGTCTCTGCTCA 
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FIGURE 46 



MAG SPTCLTLIYI LWQL T G S AAS G PVKE LVGS VGGAVT FPLKS KVKQVDS I VWT FNT TPLVT I QP 
EGGTIIVTQNRNRERVDFPDGGYSLKLSKLKKNDSGIYYVGIYSSSLQQPSTQEYVLHVYEHLSK 
PKVTMGLQSNKNGTCVTNLTCCMEHGEEDVIYTWKMjGQAMIESHNGSILPISWRWGESDMTFIC 
VARNPVSRNFSSPILARKLCEGAADDPDSSMVLLCLLLVPLLLSLFVLGLFLWFLKRERQEEYIE 
EKKRVDICRETPNICPHSGENTEYDTIPHTNRTILKEDPANTVYSTVEIPKKMENPHSLLTMPDT 

PRLFAYENVI 

Important features: 
Signal peptide: 

amino acids 1-22 

Transmembrane domain: 

amino acids 224-250 

Leucine zipper pattern. 

amino acids 229-251 

N-glycosylation sites. 

amino acids 98-102, 142-146, 148-152, 172-176, 176-180, 204-208, 
291-295 
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FIGURE 47 



GGCTCGAGCGTTTCTGAGCCAGGGGTGACCATGACCTGCTGCGAAGGATGGACATCCTGCAATGG 
ATTCAGCCTGCTGGTTCTACTGCTGTTAGGAGTAGTTCTCAATGCGATACCTCTAATTGTCAGCT 

TAGT T GAGGAAGAC CAAT T T TC T C AAAAC CC C AT CTCTTGCTT TGAG T G GT GGT TCCC AGGAAT T 
ATAGGAGCAGGTCTGATGGCCAT TCCAGCAACAACAATG TCCT TGACAGCAAGAAAAAGAGCGTG 
C TGCAACAACAGAAC T GGAAT GT T TC T T T CAT CAT T T TT CAGT GTGAT CACAGTCAT TGGTGCTC 
TGTATTGCATGCTGATATCCATCCAGGCTCTCTTAAAAGGTCCTCTCATGTGTAATTCTCCAAGC 

AACAGTAAT GCCAAT T G TGAAT T T T CAT T GAAAAACAT CAGT GACAT T CAT CCAGAAT CCT T CAA 
CTTGCAGTGGTTTTTCAATGACTCTTGTGCACCTCCTACTGGTTTCAATAAACCCACCAGTAACG 
ACACCAT GGCGAGTGGCTGGAGAGCAT C TAGT T TCCACT T CGAT T CT GAAGAAAACAAACATAGG 
CTTATCCACTTCTCAGTATTTTTAGGTCTATTGCTTGTTGGAATTCTGGAGGTCCTGTTTGGGCT 
CAGTCAGATAGTCATCGGTTTCCTTGGCTGTCTGTGTGGAGTCTCTAAGCGAAGAAGTCAAATTG 
TG TAGT TT AAT GGGAAT A&AATGTAAG TAT CAGTAGT T TGAAAAAAAAAAA 
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FIGURE 48 

MTCCEGWTSCNGFSLLVLLLLGWLNAIPLIVSLVEEDQFSQNPISCFEWWFPGIIGAGLMAIPA' 
TTMSLTARKRACCNNRTGMFLSSFFSVITVIGALYCMLISIQALLKGPLMCNSPSNSNANCEFSL 
KNISDIHPESFNLQWFFNDSCAPPTGFNKPTSNDTMASGWRASSFHFDSEENKHRLIHFSVFLGL 
LLVGILEVLFGLSQIVIGFLGCLCGVSKRRSQIV 

Important features: 
Transmembrane domains: 

amino acids 10-31 (type II), 50-72, 87-110, 191-213 
N-glycosylation sites. 

amino acids 80-84, 132-136, 148-152, 163-167 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 223-227 

N-myristoylation sites. 

amino acids 22-28, 54-60, 83-89, 97-103, 216-222 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 207-218 

TNFR/NGFR family cysteine-rich region protein. 

amino acids 4-12 
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FIGURE AQ 



ATCCGTTCTCTGCGCTGCCAGCTCAGGTGAGCCCTCGCCAAGGTGACCTCGCAGGACACTGGTGA 
AGGAGCAGTGAGGAACCTGCAGAGTCAGACAGTTGC TGACCAAT TGAGCT GT GAGCCTGGAGCAG 
ATCCGTGGGCTGCAGACCCCCGCCCCAGTGCCTCTCCCCCTGCAGCCCTGCCCCTCGAACTGTGA 
CATGGAGAGAGTGACCCTGGCCCTTCTCCTACTGGCAGGCCTGACTGCCTTGGAAGCCAATGACC 
CATTTGCCAATAAAGACGATCCCTTCTACTATGACTGGAAAAACCTGCAGCTGAGCGGACTGATC 
TGCGGAGGGCTCCTGGCCATTGCTGGGATCGCGGCAGTTCTGAGTGGCAAATGCAAATACAAGAG 
CAGCCAGAAGCAGCACAGTCCTGTACCTGAGAAGGCCATCCCACTCATCACTCCAGGCTCTGCCA 
CTACTTGCTGAGCACAGGACTGGCCTCCAGGGATGGCCTGAAGCCTAACACTGGCCCCCAGCACC 
TCCTCCCCTGGGAGGCCTTATCCTCAAGGAAGGACTTCTCTCCAAGGGCAGGCTGTTAGGCCCCT 
TTCTGATCAGGAGGCTTCTTTATGAATTAAACTCGCCCCACCACCCCCTCA 
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FIGURE- so 

MERVTLALLLLAGLTALE^DPFANKDDPFYYDWBO^LQLSGLICGGLIAIAGIMVLSGKCKYKS 
S QKQHS PVPEKAI PL ITPGSAT T C 

Important features: 
Signal peptide: 

amino acids 1-16 

Transmembrane domain: 

amino acids 36-59 

N-myristoylation sites. 

amino acids 41-47, 45-51, 84-90 

Extracellular proteins SCP/Tpx-l/Ag5/PR-l/Sc7 . 

amino acids 54-67 
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FIGURE si 

GTGGACTCTGAGAAGCCCAGGCAGTTGAGGACAGGAGAGAGAAGGCTGCAGACCCAGAGGGAGGG 

AGGACAGGGAGTCGGAAGGAGGAGGACAGAGGAGGGCACAGAGACGCAGAGCAAGGGCGGCAAGG 

AGGAGACCCTGGTGGGAGGAAGACACTCTGGAGAGAGAGGGGGCTGGGCAGAGATGAAGTTCCAG 

GGGCCCCTGGCCTGCCTCCTGCTGGCCCTCTGCCTGGGCAGTGGGGAGGCTGGCCCCCTGCAGAG 

CGGAGAGGAAAGCACTGGGACAAATATTGGGGAGGCCCTTGGACATGGCCTGGGAGACGCCCTGA 

GCGAAGGGGTGGGAAAGGCCATTGGCAAAGAGGCCGGAGGGGCAGCTGGCTCTAAAGTCAGTGAG 

GCCCTTGGCCAAGGGACCAGAGAAGCAGTTGGCACTGGAGTCAGGCAGGTTCCAGGCTTTGGCGC 

AGCAGATGCTTTGGGCAACAGGGTCGGGGAAGCAGCCCATGCTCTGGGAAACACTGGGCACGAGA 

TTGGCAGACAGGCAGAAGATGTCATTCGACACGGAGCAGATGCTGTCCGCGGCTCCTGGCAGGGG 

GTGCCTGGCCACAGTGGTGCTTGGGAAACTTCTGGAGGCCATGGCATCTTTGGCTCTCAAGGTGG 

CCTTGGAGGCCAGGGCCAGGGCAATCCTGGAGGTCTGGGGACTCCGTGGGTCCACGGATACCCCG 

GAAAC T CAGCAGGCAGCT T T GGAATGAAT CC TCAGGGAGCTCCC TGGGG TC AAGGAGGCAAT GGA 

GGGCCACCAAACTTTGGGACCAACACTCAGGGAGCTGTGGCCCAGCCTGGCTATGGTTCAGTGAG 

AGC CAGCAACCAGAAT GAAG GGT GCACGAAT CC CCCAC CAT CTGGC TCAGG X GGAGGCT C CAGCA 

ACTCTGGGGGAGGCAGCGGCTCACAGTCGGGCAGCAGTGGCAGTGGCAGCAATGGTGACAACAAC 

AATGGCAGCAGCAGTGGTGGCAGCAGCAGTGGCAGCAGCAGTGGCAGCAGCAGTGGCGGCAGCAG 

TGGCGGCAGCAGTGGTGGCAGCAGTGGCAACAGTGGTGGCAGCAGAGGTGACAGCGGCAGTGAGT 

CCTCCTGGGGATCCAGCACCGGCTCCTCCTCCGGCAACCACGGTGGGAGCGGCGGAGGAAATGGA 

CATAAACCCGGGTGTGAAAAGCCAGGGAATGAAGCCCGCGGGAGCGGGGAATCTGGGATTCAGGG 

C T TCAGAGGACAGGGAGT T T C CAGCAACAT GAGGGAAATAAGCAAAGAG GGCAATCGCCT CC T T G 

GAGGCTCTGGAGACAATTATCGGGGGCAAGGGTCGAGCTGGGGCAGTGGAGGAGGTGACGCTGTT 

GGTGGAGTCAATACTGTGAACTCTGAGACGTCTCCTGGGATGTTTAACTTTGACACTTTCTGGAA 

GAATTTTAAATCCAAGCTGGGTTTCATCAACTGGGATGCCATAAACAAGGACCAGAGAAGCTCTC 

GCATCCC GTGAC C T CCAGACAAGGAGCCACCAGAT TGGATGGGAGC CCCCAC AC T C CC T CC TT AA 

AACACCACCCTCTCATCACTAATCTCAGCCCTTGCCCTTGAAATAAACCTTAGCTGCCCCACAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^ 
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FIGURE 52 



MKFQGPIACLLLALCLGSGEAGPLQSGEESTGTNIGEALGHGLGDALSEGVGKAIGKEAGGAAGSKVS 
EALGQGTREAVGTGVRQVPGFGAADALGNRVGEAAHALGNTGHE I GRQAEDVIRHGADAVRGSWQGVP 
GHSGAWETSGGHGIFGSQGGLGGQGQGNPGGLGTPWVHGYPGNSAGSFGMNPQGAPWGQGGNGGPPNF 
GTNTQGAVAQPGYGSVRASNQNEGCTNPPPSGSGGGSSNSGGGSGSQSGSSGSGSNGDNNNGSSSGGS 
SSGSSSGSSSGGSSGGSSGGSSGNSGGSRGDSGSESSWGSSTGSSSGNHGGSGGGNGHKPGCEKPGNE 
ARGSGESGIQGFRGQGVSSNMREISKEGNRLLGGSGDNYRGQGSSWGSGGGDAVGGVNTVNSETSPGM 
FNFDTFWKNFKSKLGFINWDAINKDQRSSRIP 

O Signal peptide: 

W amino acids 1-21 

m 

.: ; N-glycosylation site. 

= B s amino acids 265-269 

s Glycosaminoglycan attachment site. 

O amino acids 235-239, 237-241, 244-248, 255-259, 324-328, 388-392 

^ Casein kinase II phosphorylation site. 

% amino acids 26-30, 109-113, 259-263, 300-304, 304-308 

i 

N-myristoylation site. 

amino acids 17-23, 32-38, 42-48, 50-56, 60-66, 61-67, 64-70, 74-80, 

90-96, 96-102, 130-136, 140-146, 149-155, 152-158, 155-161, 

159-165, 163-169, 178-184, 190-196, 194-200, 199-205, 218-224, 

236-242, 238-244, 239-245, 240-246, 245-251, 246-252, 249-252, 

253-259, 256-262, 266-272, 270-276, 271-277, 275-281, 279-285, 

283-289, 284-290, 287-293, 288-294, 291-297, 292-298, 295-301, 

298-304, 305-311, 311-317, 315-321, 319-325, 322-328, 323-329, 

325-331, 343-349, 354-360, 356-362, 374-380, 381-387, 383-389, 
387-393, 389-395, 395-401 

Cell attachment sequence. 

amino acids 301-304 
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FIGURE 53 

GGAGAAGAGGTTGTGTGGGACAAGCTGCTCCCGACAGAAGGATGTCGCTGCTGAGCCTGCCCTGG 
CTGGGCCTCAGACCGGTGGCAATGTCCCCATGGCTACTCCTGCTGCTGGTTGTGGGCTCCTGGCT 
ACTCGCCCGCATCCTGGCTTGGACCTATGCCTTCTATAACAACTGCCGCCGGCTCCAGTGTTTCC 
CACAGCCCCCAAAACGGAACTGGTTTTGGGGTCACCTGGGCCTGATCACTCCTACAGAGGAGGGC 
TTGAAGGACTCGACCCAGATGTCGGCCACCTATTCCCAGGGCTTTACGGTATGGCTGGGTCCCAT 
CATCCCCTTCATCGTTTTATGCCACCCTGACACCATCCGGTCTATCACCAATGCCTCAGCTGCCA 
TTGCACCCAAGGATAATCTCTTCATCAGGTTCCTGAAGCCCTGGCTGGGAGAAGGGATACTGCTG 
AGTGGCGGTGACAAGTGGAGCCGCCACCGTCGGATGCTGACGCCCGCCTTCCATTTCAACATCCT 
GAAGTCCTATATAACGATCTTCAACAAGAGTGCTU^ACATCATGCTTGACAAGTGGCAGCACCTGG 
CCTCAGAGGGCAGCAGTCGTCTGGACATGTTTGAGCACATCAGCCTCATGACCTTGGACAGTCTA 
CAGAAATGCATCTTCAGCTTTGACAGCCATTGTCAGGAGAGGCCCAGTGAATATATTGCCACCAT 
C T T GGAGCT CAGTGCC CT TGTAGAGAAAAGAAGCC AG CAT AT CC TCCAGCACATGGAC T T TCT GT 
ATTACCTCTCCCATGACGGGCGGCGCTTCCACAGGGCCTGCCGCCTGGTGCATGACTTCACAGAC 
GCTGTCATCCGGGAGCGGCGTCGCACCCTCCCCACTCAGGGTATTGATGATTTTTTCAAAGACAA 
AG C C AAG T C C AAG AC T T T G GAT T T CAT T GAT GTGCTTCTGCT GAG C AAG GAT GAAG A T G G G AAG G 
CAT T GT CAGATGAGGATATAAGAGCAGAGGC TGACACC T T CATG T T TGGAG GCCATGACACCACG 
GCCAGTGGCCTCTCCTGGGTCCTGTACAACCTTGCGAGGCACCCAGAATACCAGGAGCGCTGCCG 
ACAGGAGGTGCAAGAGCTTCTGAAGGACCGCGATCCTAAAGAGATTGAATGGGACGACCTGGCCC 
AGCTGCCCTTCCTGACCATGTGCGTGAAGGAGAGCCTGAGGTTACATCCCCCAGCTCCCTTCATC 
TCCCGATGCTGCACCCAGGACATTGTTCTCCCAGATGGCCGAGTCATCCCCAAAGGCATTACCTG 
CCTCATCGATATTATAGGGGTCCATCACAACCCAACTGTGTGGCCGGATCCTGAGGTCTACGACC 
CCTTCCGCTTTGACCCAGAGAACAGCAAGGGGAGGTCACCTCTGGCTTTTATTCCTTTCTCCGCA 
GGGCCCAGGAACTGCATCGGGCAGGCGTTCGCCATGGCGGAGATGAAAGTGGTCCTGGCGTTGAT 
GC T GC T GCAC T T CCGGT T CC T GCCAGAC CACAC T GAGC C C C GCAGGAAGC T GGAAT T GAT CAT GC 
GCGCCGAGGGCGGGCTTTGGCTGCGGGTGGAGCCCCTGAATGTAGGCTTGCAGTGACTTTCTGAC 
CC AT CC ACC T GT T T T T T T GCAGAT TGT CAT G AAT AAAACGGT GC T G T CAAA 



FIGURE 

MSLLSLPWLGLRPVAMSPWLLLLLW 

LITPTEEGLKDSTQMSATYSQGFTVWLGPIIPFIVLCHPDTIRSITNASAAIAPKDNLFIRFLKP 
WLGEGILLSGGDKWSRHRRMLTPAFHFNILKSYITIFNKSANIMLDKWQHLASEGSSRLDMFEHI 
SLMTLDSLQKCIFSFDSHCQERPSEYIATILELSALVEKRSQHILQHMDFLYYLSHDGRRFHRAC 
RLVHDFTDAVIRERRRTLPTQGIDDFFKDKAKSKTLDFIDVLLLSKDEDGKALSDEDIRAE7VDTF 
MFGGHDTTASGLSWVLYNLARHPEYQERCRQEVQELLKDRDPKEIEWDDLAQLPFLTMCVKESLR 
LHPPAPFI SRCCTQDIVLPDGRVIPKGITCLIDI IGVHHNPTVWPDPEVYDPFRFDPENSKGRSP 
LAF I P FS AGPRNC I GQAFAMAEMKVVLALMLLH FRFL PDHTE PRRKLE L IMRAEGGL WLRVE PLN 
VGLQ 

Important features: 
Transmembrane domains: 

amino acids 13-32 (type II) , 77-102 

Cytochrome P450 cysteine heme-iron ligand signature. 

amino acids 461-471 

N-glycosylation sites. 

amino acids 112-116, 168-172 
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FIGURE ss 

ATCGCATCAATTGGGAGTACCATCTTCCTCATGGGACCAGTGAAACAGCTGAAGCGAATGTTTGA 
GCCTACTCGTTTGATTGCAACTATCATGGTGCTGTTGTGTTTTGCACTTACCCTGTGTTCTGCCT 
TTTGGTGGCATAACAAGGGACTTGCACTTATCTTCTGCATTTTGCAGTCTTTGGCATTGACGTGG 
TACAGCCTTTCCTTCATACCATTTGCAAGGGATGCTGTGAAGAAGTGTTTTGCCGTGTGTCTTGC 
ATAAT T CAT GGCCAGT T TT ATGAAGCT TT GGAAGGCACTAT GGACAGAAGC T GGTGGACAGT T T T 
GTAACTATCTTCGAAACCTCTGTCTTACAGACATGTGCCTTTTATCTTGCAGCAATGTGTTGCTT 
G TGAT T C GAACAT T T GAGGG T TAC T T T T GGAAGC AACAAT AC AT TCTC GAAC C TGAATG T CAG T A 
GCACAGGATGAGAAGTGGGTTCTGTATCTTGTGGAGTGGAATCTTCCTCATGTACCTGTTTCCTC 
TCT GGAT GT T G T CC CAC T GAAT TCCCAT GAATACAAACCTAT TCAGCAACAGCAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 56 

MGPVKQLKRMFEPTRLIAT IMVLLCFALTLCSAFWWHNKGLALI FCILQSLALTWYSLS FI PFAR 
DAVKKC FAVCLA 



Important features: 
Signal peptide: 

amino acids 1-33 

j^i Type II fibronectin collagen -binding domain protein. 

O amino acids 30-72 

o 
m 

UT 

H 
III 

II! 

O . 

If! 

0 

m 
a 
rii 
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FIGURE 57 

CGGCTCGAGCTCGAGCCGAATCGGCTCGAGGGGCAGTGGAGCACCCAGCAGGCCGCCAACATGCTCTGTCTGTGCCTG 
TACGTGCCGGTCATCGGGGAAGCCCAGACCGAGTTCCAGTACTTTGAGTCGAAGGGGCTCCCTGCCGAGCTGAAGTCC 
ATTT T CAAGCT CAGT GT CTTCAT C CCCT CCCAGGAATT CTC CACCTAC CGC CAGT GGAAGCAGAAAAT T GTACAAGCT 
GGAGATAAGGACCTTGATGGGCAGCTAGACTTTGAAGAATTTGTCCATTATCT CCAAGATCAT GAGAAGAAGCT GAGG 
CT GGT GTTTAAGATTTTGGACAAAAAGAATGATGGACGCATTGACGCGCAGGAGAT CATGCAGT CCCTGCGGGACTTG 
GGAGT CAAGATATCTGAACAG CAGGCAGAAAAAATTCT CAAGAGCAT GGAT AAAAAC GGCAC GAT GAC CAT C GACT GG 
AAC GAGT GGAGAGACT ACCAC GT C CT CCACCCC GTGGAAAACAT CC C CGAGAT CAT CCT CT ACT GGAAG CAT T C CAC G 
AT CT TT GAT GT GGGTGAGAAT CT AACGGT CCC GGAT GAGTT CACAGT G GAGGAGAGGCAGAC GG GGAT GTGGT GGAGA 
CACCTGGTGGCAGGAGGTGGGGCAGGGGCCGTATCCAGAACCTGCACGGCCCCCCTGGACAGGCTCAAGGTGCTCATG 
C AGGT C CAT GCCT CC C GCAGCAACAACAT GGGCAT CGT T GGT GG CTT CACT CAGAT GAT T C GAGAAGGAGGGGC CAGG 
T CACT CTGG C GG GG CAAT G GCAT CAAC GT CCT CAAAAT T GC CCC CGAAT CAGCCAT CAAAT T CAT GGC C TAT GAGCAG 
AT C AAGC GC CTT GTTGGT AGT GACCAGGAGACT CTGAGGAT T CACGAGAGGCTT GT GGCAG GGT C CTT GGCAGGGGC C 
ATCGCCCAGAGCAGCATCTACCCAATGGAGGTCCTGAAGACCCGGATGGCGCTGCGGAA.GACAGGCCAGTACTCAGGA 

AT GCTGGACT GCGCCAGGAGGAT CCTGGCCAGAGAGGGG GT GGCCGCCTTCT ACAAAGGCTATGT C C CCAACAT GCT G 
GGCATCATCCCCTATGCCGGCATCGACCTTGCAGTCTACGAGACGCTCAAGAATGCCTGGCTGCAGCACTATGCAGTG 
AAC AGC G C GGACC C CGGC GT GT TT GT GCT CC T GG CCT GT GGCACCAT GT C C AGTAC CT GT GGCCAGCT GGC CAGCTAC 
CCCCTGGCCCTAGTCAGGACCCGGATGCAGGCGCAAGCCTCTATTGAGGGCGCTCCGGAGGTGACCATGAGCAGCCTC 
TTCAAACATATCCTGCGGACCGAGGGGGCCTTCGGGCTGTACAGGGGGCTGGCCCCCAACTTCATGAAGGTCATCCCA 
GCTGTGAGCATCAGCTACGTGGTCTACGAGAACCTGAAGATCACCCTGGGCGTGCAGTCGCGGTGACGGGGGGAGGGC 
CGCCCGGCAGTGGACTCGCTGATCCTGGGCCGCAGCCTGGGGTGTGCAGCCATCTCATTCTGTGAATGTGCCAACACT 
AAGCT GT CT C GAGCCAAGCT GT GAAAACCCTAGACGCAC CCGCAGGGAGGGT GG G GAGAGCT GGC AG G C CCAGGGCTT 
GTCCTGCTGACCCCAGCAGACCCTCCTGTTGGTTCCAGCGAAGACCACAGGCATTCCTTAGGGTCCAGGGTCAGCAGG 
CTCCGGGCTCACATGTGTAAGGACAGGACATTTTCTGCAGTGCCTGCCAATAGTGAGCTTGGAGCCTGGAGGCCGGCT 
TAGTTCTTCCATTTCACCCTTGCAGCCAGCTGTTGGCCACGGCCCCTGCCCTCTGGTCTGCCGTGCATCTCCCTGTGC 
CCTCTTGCTGCCTGCCTGTCTGCTGAGGTAAGGTGGGAGGAGGGCTACAGCCCACATCCCACCCCCTCGTCCAATCCC 
AT AAT CCAT GAT GAAAGGT GAGGT CAC GT G GC CTC C CAGGCCTGAC T TC CCAAC CTAC AGCAT T GACGCCAACTT GGC 
T GTGAAGGAAGAGGAAAGGATCTGGCCTTGTGGTCACTGGCATCTGAGCCCTGCT GATGGCT GGGGCT CTC GGGCAT G 
CTTGGGAGTGCAGGGGGCTCGGGCTGCCTGGCCTGGCTGCACAGAAGGCAAGTGCTGGGGCTCATGGTGCTCTGAGCT 
GGCCTGGACCCTGTCAGGATGGGCCCCACCTCAGAACCAAACTCACTGTCCCCACTGTGGCATGAGGGCAGTGGAGCA 
C CAT GTT TGAGGGCGAAGGGCAGAGCGTTT GT GT GTT CT GGGGAGGGAAGGAAAAGGT GTT G GAGGC CT TAATTAT GG 
ACTGTT GGGAAAAGGGT TTT GT CCAGAAGGACAAGCC GGACAAAT GAGCGACTT CT GT GCT T CCAGAGGAAGAC GAGG 
GAGCAGGAGCTTGGCTGACTGCTCAGAGTCTGTTCTGACGCCCTGGGGGTTCCTGTCCAACCCCAGCAGGGGCGCAGC 
GGGACCAGCCCCACATTCCACTTGTGTCACTGCTTGGAACCTATTTATTTTGTATTTATTTGAACAGAGTTATGTCCT 
AACTATTTT TATAGAT TT GT T TAATTAATAGCTT GT CATTT T CAAGT TCAT TTT TT ATT C AT AT TTATGTT CAT GGTT 
GATTGTACCTTCCCAAGCCCGCCCAGTGGGATGGGAGGAGGAGGAGAAGGGGGGCCTTGGGCCGCTGCAGTCACATCT 
GTCCAGAGAAATTCCTTTTGGGACTGGAGGCAGAAAAGCGGCCAGAAGGCAGCAGCCCTGGCTCCTTTCCTTTGGCAG 
GT TGGGGAAG GGCT T GCCCC CAGC CTT AGGAT TT CAGGGT TT GACT GGG GGCGT GGAGAGAGAGGGAGGAAC CT CAAT 
AACCTTGAAGGTGGAATCCAGTTATTTCCTGCGCTGCGAGGGTTTCTTTATTTCACTCTTTTCTGAATGTCAAGGCAG 
TGAGGTGCCTCTCACTGTGAATTTGTGGTGGGCGGGGGCTGGAGGAGAGGGTGGGGGGCTGGCTCCGTCCCTCCCAGC 
CTTCTGCTGCCCTTGCTTAACAATGCCGGCCAACTGGCGACCTCACGGTTGCACTTCCATTCCACCAGAATGACCTGA 
T GAGGAAAT CT T CAATAGGAT GCAAAGAT CAATGCAAAAATT GT TATATAT GAACAT ATAACT GGAGTC GT CAAAAAG 
CAAATTAAGAAAGAAT T GGACGT TAGAAGT T GTC ATT TAAAGCAGC CTT CTAATAAAGT T GTTT CAAAGC T GAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



FIGURE 58 

MLCLCLYVPVIGEAQTEFQYFESKGLPAELKSIFKLSVFIPSQEFSTYRQWKQKIVQAGDKDLDG 
QLDFEE FVHYLQDHEKKLRLVFKILDKKNDGRI DAQE IMQSLRDLGVKI SEQQAEKI LKSMDKNG 
TMT I DWNEWRDYHLLHPVENI PE 1 1 LYWKHST I FDVGENLTVPDE FTVEERQTGMWWRHLVAGGG 
AGAVSRTCTAPLDRLKVLMQVfiASRSNmGIVGGFTQMIREGGARSLWRGNGINVLKIAPESAIK 
FMAYE Q I KRLVG S DQE TLR I HERLVAGSLAGAI AQS S I YPME VLKTRMALRKTGQ Y S GMLDCARR 
ILAREGVAAFYKGYVPNMLG 1 1 PYAGI DLAVYETLKNAWLQHYAVNSADPGVFVLLACGTMSSTC 
GQLAS YPLALVRTRMQAQAS IEGAPEVTMS SLFKHI LRTEGAFGL YRGLAPNFMKVI PAVS I S YV 
VYENLKITLGVQSR 

Important features: 
Signal peptide: 

amino acids 1-16 

Putative transmembrane domains : 

amino acids 284-304, 339-360, 376-394 

Mitochondrial energy transfer proteins signature. 

amino acids 206-215, 300-309 

N-glycosy lation sites . 

amino acids 129-133, 169-173 

Elongation Factor-hand calcium-binding protein. 

amino acids 54-73, 85-104, 121-140 
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FIGURE f?Q 

GGAAG GC AGC G G C AG C T C CAC T C AG C C AG T AC C GAG AT AC GC T G GGAAC C T T C C C CAG C CATGGC 
TTCCCTGGGGCAGATCCTCTTCTGGAGCATAATTAGCATCATCATTATTCTGGCTGGAGCAATTG 
CACTCATCATTGGCTTTGGTATTTCAGGGAGACACTCCATCACAGTCACTACTGTCGCCTCAGCT 
GGGAACAT TGGGGAGGATGGAATCCTGAGCTGCACTT TT GAACCTGACAT CAAACTTTCTGATAT 
CGTGATACAATGGCTGAAGGAAGGT GTTT TAGGC TTGGT CCATGAGT TCAAAGAAGGCAAAGATG 

AGCT GTCGGAGCAGGAT GAAAT GT TCAG AGGC CGGACAGCAGT G T T T GC T GATCAAGT GAT AGT T 
GGCAATGCCTCTTTGCGGCTGAAAAACGTGCAACTCACAGATGCTGGCACCTACAAATGTTATAT 
a : CATCACTTCTAAAGGCAAGGGGAATGCTAACCTTGAGTATAAAACTGGAGCCTTCAGCATGCCGG 
h AAGTGAATGTGGACTATAATGCCAGCTCAGAGACCTTGCGGTGTGAGGCTCCCCGATGGTTCCCC 
CI CAGCCCACAGTGGTCTGGGCATCCCAAGTTGACCAGGGAGCCAACTTCTCGGAAGTCTCCAATAC 
P CAGCTTTGAGCTGAACTCTGAGAATGTGACCATGAAGGTTGTGTCTGTGCTCTACAATGTTACGA 
TCAACAACACATAC T CC TGTATGATTGAAAATGACAT T GC CAAAGCAACAGGGGATATCAAAGTG 
ACAGAATCGGAGATCAAAAGGCGGAGT CACC T ACAGC T GC TAAAC T CAAAGGCT TC TCT GT G T GT 
If! CTCTTCTTTCTTTGCCATCAGCTGGGCACTTCTGCCTCTCAGCCCTTACCTGATGCTAAAATAAT 
R . GTGCCTTGGCCACAAAAAAGCATGCAAAGTCATTGTTACAACAGGGATCTACAGAACTATTTCAC 

O CACCAGATATGACCTAGTT T TATATT TCTGGGAGGAAATGAATTCATAT CTAGAAGTCTGGAGTG 

AGCAAACAAGAGCAAGAAACAAAAAGAAGCCAAAAGC AGAAGGC TC CAATAT GAACAAGAT AAAT 
C TAT C T T CAAAGACAT AT T AGAAG T TGGGAAAATAAT T CAT GT GAAC T AGACAAGT G TGT T AAGA 
"Q GTGATAAGTAAAATGCACGTGGAGAC^AGTGCATCCCCAGATCTCAGGGACCTCCCCCTGCCTGT 
fU CACC TGGGGAGT GAGAGGACAGGATAG T GCAT GT TCTTTGTCTCT GAAT T T T TAGT TATATGT GC 

TGTAATGTTGCTCTGAGGAAGCCCCTGGAAAGTCTATCCCAACATATCCACATCTTATATTCCAC 

AAATTAAGCTGTAGTAT GT ACCC TAAGACGC TGC TAAT T GACTGCCAC T T CGCAAC T CAGGGGCG 
GCTGCATTTTAGTAATGGGTCAAATGATTCACTTTTTATGATGCTTCCAAAGGTGCCTTGGCTTC 
TCTTCCCAACTGACAAATGCCAAAGTTGAGAAAAATGATCATAATTTTAGCATAAACAGAGCAGT 
CGGGGACACCGATTTTATAAATAAACTGAGCACCTTCTTTTTAAACAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 60 



MAS LGQ I L FWS 1 1 S 1 1 1 1 LAGAI AL 1 1 GFG I SGRHS I T VT T VAS AGN I GEDG I LS CT FE PDI KLS 
DIVIQWLKEGVLGLVHEFKEGKDELSEQDEMFRGRTAVFADQVIVGNASLRLKNVQLTDAGTYKC 
YI I T SKGKGNANLE YKT GAFSMPEVNVDYNAS S E TLRCEAPRW FPQPTWWAS QVDQGANFSEVS 
NTSFELNSENVTMKVVSVLYNVTIlSn^ 
CVS SFFAI S WALLPLS PYLMLK 

Important features: 
Signal peptide: 

amino acids 1-28 

Transmembrane domain: 

amino acids 258-281 

N-glycosylation sites. 

amino acids 112-116, 160-164, 190-194, 196-200, 205-209, 216-220, 
220-224 

N-myristoylation sites. 

amino acids 52-58, 126-132, 188-194 
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FIGURE 61 

T GACGTCAGAATCACCATGGCCAGCTAT CC TTACCGGCAGGGCTGCCCAGGAGC TGCAGGACAAG 
CACCAGGAGCCCCTCCGGGTAGCTACTACCCTGGACCCCCCAATAGTGGAGGGCAGTATGGTAGT 
GGGCTACCCCCTGGTGGTGGTTATGGGGGTCCTGCCCCTGGAGGGCCTTATGGACCACCAGCTGG 
TGGAGGGCCCTATGGACACCCCAATCCTGGGATGTTCCCCTCTGGAACTCCAGGAGGACCATATG 
GCGGTGCAGCTCCCGGGGGCCCCTATGGTCAGCCACCTCCAAGTTCCTACGGTGCCCAGCAGCCT 
GGGCTTTATGGACAGGGTGGCGCCCCTCCCAATGTGGATCCTGAGGCCTACTCCTGGTTCCAGTC 
GGTGGACTCAGATCACAGTGGCTATATCTCCATGAAGGAGCTAAAGCAGGCCCTGGTCAACTGCA 
AT TGGT CT T CAT T CAAT GAT GAGAC CT GC CT CAT GAT GAT AAAC AT G T T TGAC AAGAC C AAGTCA 
GGCCGCATCGATGTCTACGGCTTCTCAGCCCTGTGGAAATTCATCCAGCAGTGGAAGAACCTCTT 
CCAGCAGTATGACCGGGACCGCTCGGGCTCCATTAGCTACACAGAGCTGCAGCAAGCTCTGTCCC 
AAATGGGCTACAACCTGAGCCCCCAGTTCACCCAGCTTCTGGTCTCCCGCTACTGCCCACGCTCT 
GCCAATCCTGCCATGCAGCTTGACCGCTTCATCCAGGTGTGCACCCAGCTGCAGGTGCTGACAGA 
GGCCTTCCGGGAGAAGGACACAGCTGTACAAGGCAACATCCGGCTCAGCTTCGAGGACTTCGTCA 
CCATGACAGCTTCTCGGATGCTATGACCCAACCATCTGTGGAGAGTGGAGTGCACCAGGGACCTT 
TCCTGGCTTCTTAGAGTGAGAGAAGTATGTGGACATCTCTTCTTTTCCTGTCCCTCTAGAAGAAC 
ATTCTCCCTTGCTTGATGCAACACTGTTCCAAAAGAGGGTGGAGAGTCCTGCATCATAGCCACCA 
AATAGTGAGGACCGGGGCTGAGGCCACACAGATAGGGGCCTGATGGAGGAGAGGATAGAAGTTGA 
ATGTCCTGATGGCCATGAGCAGTTGAGTGGCACAGCCTGGCACCAGGAGCAGGTCCTTGTAATGG 
AGTTAGTGTCCAGTCAGCTGAGCTCCACCCTGATGCCAGTGGTGAGTGTTCATCGGCCTGTTACC 
GTTAGTACCTGTGTTCCCTCACCAGGCCATCCTGTCAAACGAGCCCATTTTCTCCAAAGTGGAAT 
CTGACCAAGCATGAGAGAGATCTGTCTATGGGACCAGTGGCTTGGATTCTGCCACACCCATAAAT 
CCTTGTGTGTTAACTTCTAGCTGCCTGGGGCTGGCCCTGCTCAGACAAATCTGCTCCCTGGGCAT 
CTTTGGCCAGGCTTCTGCCCCCTGCAGCTGGGACCCCTCACTTGCCTGCCATGCTCTGCTCGGCT 
TCAGTCTCCAGGAGACAGTGGTCACCTCTCCCTGCCAATACTTTTTTTAATTTGCATTTTTTTTC 
AT T TGGGGC CAAAAGT CCAGT GAAAT T GT AAGCT TCAAT AAAAGGATGAAAC T C T GA 
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FIGURE 62 



m 



MASYPYRQGCPGAAGQAPGAPPGSYYPGPPNSGGQYGSGLPPGGGYGGPAPGGPYGPPAGGGPYG 
HPNPGMFPSGTPGGPYGGAAPGGPYGQPPPSSYGAQQPGLYGQGGAPPNVDPEAYSWFQSVDSDH 
S G Y I SMKE LKQAL VNCNWS S FNDE TC LMM I NMFDKTKS GR I DVYG FS ALWK F I QQWKNL FQQ YDR 
DRSGSISYTELQQALSQMGYNLSPQFTQLLVSRYCPRSANPAMQLDRFIQVCTQLQVLTEAFREK 
DTAVQGMI RLS FEDFVTMTASRML 

Important features of the protein: 
Signal peptide: 

amino acids 1-19 

N-glycosylation site. 

amino acids 147-150 

Casein kinase II phosphorylation sites. 

amino acids 135-138, 150-153, 202-205, 271-274 

N-myristoylation sites. 

amino acids 9-14, 15-20, 19-24, 33-38, 34-39, 39-44, 43-48, 61- 
66, 70-75, 78-83, 83-88, 87-92, 110-115 
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FIGURE 6a 



CAGGATCCAGGGCCGCGTGGCAGGGAGCTGCGCTCCTCTGGGCCTGCTCCTGGTCTGTCTTCATC 
TCCCAGGCCTCTTTGCCCGGAGCATCGGTGTTGTGGAGGAGAAAGTTTCCCAAAACTTCGGGACC 
AACTTGCCTCAGCTCGGACAACCTTCCTCCACTGGCCCCTCTAACTCTGAACATCCGCAGCCCGC 
TCTGGACCCTAGGTCTAATGACTTGGCAAGGGTTCCTCTGAAGCTCAGCGTGCCTCCATCAGATG 
GCTTCCCAGCTGCAGGAGGTTCTGCAGTGCAGAGGTGGCGTCCATCGTGGGGGCTGCCTGCCATG 
GATTCCTGGCCCCCTGAGGATCCTTGGCAGATGATGGCTGCT5CGGCTGAGGACCGCCTGGGGGA 
AGCGCTGCCTGAAGAACTCTCTTACCTCTCCAGTGCTGCGGCCCTCGCTCCGGGCAGTGGCCCTT 
TGCCTGGGGAGTCTTCTCCCGATGCCACAGGCCTCTCACCTGAGGCTTCACTCCTCCACCAGGAC 
TCGGAGTCCAGACGACTGCCCCGTTCTAATTCACTGGGAGCCGGGGGAAAAATCCTTTCCCAACG 
CCCTCCCTGGTCTCTCATCCACAGGGTTCTGCCTGATCACCCCTGGGGTACCCTGAATCCCAGTG 
TGTCCTGGGGAGGTGGAGGCCGTGGGACTGGTTGGGGAACGAGGCCCATGCCACACCCTGAGGGA 
ATCTGGGGTATCAATAATCAACCCCCAGGTACCAGCTGGGGAAATATTAATCGGTATCCAGGAGG 
CAGCTGGGGAAATATTAATCGGTATCCAGGAGGCAGCTGGGGGAATATTAATCGGTATCCAGGAG 
GCAGCTGGGGGAATATTCATCTATACCCAGGTATCAATAACCCATTTCCTCCTGGAGTTCTCCGC 
CCTCCTGGCTCTTCTTGGAACATCCCAGCTGGCTTCCCTAATCCTCCAAGCCCTAGGTTGCAGTG 
GGGCTAGAGCACGATAGAGGGAAACCCAACATTGGGAGTTAGAGTCCTGCTCCCGCCCCTTGCTG 
TGTGGGCTCAATCCAGGCCCTGTTAACATGTTTCCAGCACTATCCCCACTTTTCAGTGCCTCCCC 
TGC TCAT CT CCAATAAAATAAAAGCAC T TAT GAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^ 
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FIGURE 64 



MQGRVAGSCAPLGLLLVCLHLPGLFARSIGWEEKVSQNFGTNLPQLGQPSSTGPSNSEHPQPAL 
DPRSNDIiARVPLKLSVPPSDGFPPAGGSAVQRWPPSWGLPAMDSWPPEDPWQMMAAAAEDRLGEA 
LPEELSYLSSAAALAPGSGPLPGESSPDATGLSPEASLLHQDSESRRLPRSNSLGAGGKILSQRP 
PWSLIHRVLPDHPWGTLNPSVSWGGGGPGTGWGTRPMPHPEGIWGINNQPPGTSWGNINRYPGGS 
WGNINRYPGGSWGNINRYPGGSWGNIHLYPGINNPFPPGVLRPPGSSWNIPAGFPNPPSPRLQWG 



Important features of the protein: 
Signal peptide: 

}l\ amino acids 1-26 



m 



I: 

Ml 
PI 



Casein kinase II phosphorylation sites. 

amino acids 56-59, 155-158 

N-myri s toy la tion si tes . 

amino acids 43-53, 220-225, 221-226, 224-229, 247-252, 258-263, 
259-264, 269-274, 270-275, 280-285, 281-286, 305-310 
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FIGURE 6s 



AAGGAGAGGCCACCGGGACTTCAGTGTCTCCTCCATCCCAGGAGCGCAGTGGCCACTATCGGGTC 
TGGGCTGCCCCTTGTCCTCCTCTTGACCCTCCTTGGCAGCTCACATGGAACAGGGCCGGGTATGA 
CTTTGCAACTGAAGCTGAAGGAGTCTTTTCTGACAAATTCCTCCTATGAGTCCAGCTTCCTGGAA 
TTGCTTGAAAAGCTCTGCCTCCTCCTCCATCTCCCTTCAGGGACCAGCGTCACCCTCCACCATGC 
. AAGAT C T CAACAC CAT GT T GTC T GCAACACATGACAGCCAT TGAAGCC TGT GTCC TTCTTGGCCC 
GGGCTTTTGGGCCGGGGATGCAGGAGGCAGGCCCCGACCCTGTCTTTCAGCAGGCCCCCACCCTC 
CTGAGTGGCAATAAATAAAATTCGGTATGCTG 
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MGSGLPLVLLLTLLGSSHGTGPGMTLQLKLKESFLTNSSYESSFLELLEKLCLLLHLPSGTSVTL 
HHARSQHHWCNT 

Important features: 
Signal peptide: 

amino acids 1-19 

N-glycosylation site, 

amino acids 37-41 

N-myristoylation sites . 

amino acids 15-21, 19-25, 60-66 
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FIGURE 67 



ACGGACCGAGGGTTCGAGGGAGGGACACGGACCAGGAACCTGAGCTAGGTCAAAGACGCCCGGGC 
CAGGTGCCCCGTCGCAGGTGCCCCTGGCCGGAGATGCGGTAGGAGGGGCGAGCGCGAGAAGCCCC 
TTCCTCGGCGCTGCCAACCCGCCACCCAGCCCATCGCGAACCCCGGGCTGGGGCTGCTTCTGGCG 
CTGGGCCTGCCGTTCCTGCTGGCCCGCTGGGGCCGAGCCTGGGGGCAAATACAGACCACTTCTGC 
AAATGAGAATAGCACTGTTTTGCCTTCATCCACCAGCTCCAGCTCCGATGGCAACCTGCGTCCGG 
AAGCCATCACTGCTATCATCGTGGTCTTCTCCCTCTTGGCTGCCTTGCTCCTGGCTGTGGGGCTG 

Ish GCACTGTTGGTGCGGAAGCTTCGGGAGAAGCGGCAGACGGAGGGCACCTACCGGCCCAGTAGCGA 

O GGAGCAGTTCTCCCATGCAGCCGAGGCCCGGGCCCCTCAGGACTCCAAGGAGACGGTGCAGGGCT 
GCCTGCCCATC TAGG TCCCCTCTCCTGCATCTGTCTCCCTTCATTGCTGTGTGACCTTGGGGAAA 

f J GGCAGTGCCCTCTCTGGGCAGTCAGATCCACCCAGTGCTTAATAGCAGGGAAGAAGGTACTTCAA 
AGAC TCTGCCCCT GAGG T C AAG AGAG GAT G G GG C TAT T CAC T T T TAT AT AT T TATATAAAAT TAG 

III TAGTGAGATGTAAAAAAAAAAAAAAAAAA 
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FIGURE 68 



O amino acids 86-94 



MANPGLGLLLALGLPFLLARWGRAWGQIQTTSANENSTVLPSSTSSSSDGNLRPEAITAIIWFS 
LLAALLLAVGLALLVRKLREKRQTEGTYRPSSEEQFSHAAEARAPQDSKETVQGCLPI 

Important features: 
Signal peptide: 

amino acids 1-19 

Transmembrane domain: 

amino acids 56-80 

N-glycosylation site. 

amino acids 36-40 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 8 6-90 

Tyrosine kinase phosphorylation site. 



N-myristoylation sites. 

amino acids 7-13, 26-32 



APP ID=1 0063735 



Page 222 of 322 



FIGURE 6q 



GCCAGGAATAACTAGAGAGGAACAATGGGGTTATTCAGAGGTTTTGTTTTCCTCTTAGTTCTGTGCCTGCTGCACCAG 
T CAAATACT TC CTT CATT AAGCT GAAT AAT AAT GGC TT T GAAGATATT GTCAT T GTTATAGAT CCT AGT GT GC CAGAA 
GAT GAAAAAATAATTGAACAAATAGAGGAT AT GGT GACT ACAGCTT CTACGTAC CT GTTTGAAGCCACAGAAAAAAGA 
TTTTTTTTCAAAAATGTATCTATATTAATTCCTGAGAATTGGAAGGAAAATCCTCAGTACAAAAGGCCAAAACATGAA 
AAC CATAAACAT GCTGAT GTT ATAGT T GCAC CAC CTACACT CCCAGGT AGAGAT GAAC CATACACCAAGCAGT TCACA 
GAAT GT GGAGAGAAAGGC GAATACAT T CACT TCACC CCT GACCTTCTACTT GGAAAAAAACAAAAT GAATAT GGACCA 
CCAGGCAAACTGTTTGTCCATGAGTGGGCTCACCTCCGGTGGGGAGTGTTTGATGAGTACAATGAAGATCAGCCTTTC 
TACC GT GCTAAGT CAAAAAAAAT CGAAG C AACAAGGT GT TC C G CAGGTATCT CT GGTAGAAATAGAGT TTATAAGT GT 
CAAGGAGGCAGCT GTCTTAGT AGAGC AT GC AGAATT GATTCTACAAC AAAACT GTAT GGAAAAGAT T GT CAATT CT T T 
CCTGATAAAGTACAAACAGAAAAAGCATCCATAATGTTTATGCAAAGTATTGATTCTGTTGTTGAATTTTGTAACGAA 
AAAACC CAT AAT CAAGAAGCT CCAAG C CTACAAAAC AT AAAGT G CAATT TTAGAAGTACAT G GGAG GT GAT T AGCAAT 
TCTGAGGATTTTAAAAACACCATACCCATGGTGACACCACCTCCTCCACCTGTCTTCTCATTGCTGAAGATCAGTCAA 
AGAATT GT GTGCTTAGTT CTT GAT AAGT CT GGAAGC AT G GGGG GTAAGGACCG CCTAAAT CGAATGAAT CAAGCAGCA 
AAACATT T CCTGCT GCAGACT GTT GAAAAT G GAT CC T GGGT GG GGAT GGTT CACTTT GATAGTACT GCCAC T ATTGTA 
AATAAGCTAAT C CAAATAAAAAGCAGT GAT GAAAGAAACAC ACT CAT G GCAG GATTAC CTACAT ATC CT CT GGGAGGA 
ACT T CCAT CT GCT C T GGAATTAAATAT GCAT T T C AGGT GATT G GAGAGC TACATTC CCAACT CGAT GGATC C GAAGTA 
£11 CTGCTGCTGACTGATGGGGAGGATAACACTGCAAGTTCTTGTATTGATGAAGTGAAACAAAGTGGGGCCATTGTTCAT 
s[ ; TTTATT GCTTT GG GAAGAGCT GCT GAT GAAG CAGTAAT AGAGAT GAG CAAGATAACAGGAGGAAGT CATTTTTATGTT 

T CAGAT GAAGCT CAGAACAAT GGC CT CAT T GAT GCT T T T GGGG G T CTTACAT CAGGAAAT ACT GATCT CT C CCAGAAG 
%l T C CCT T CAG CT C GAAAGT AAG GGAT T AAC AC T GAAT AGT AAT GC CT G GATGAAC GACACT GT CATAATT GATAGTACA 

■ "J GT GGGAAAGGAC ACGT T C TTT CT CAT CACAT GGAACAGT CT GC CTC C CAGTAT TTCT CT CT GGGAT C CCAGTGGAACA 

ATAATGGAAAATTTCACAGTGGATGCAACTT CCAAAAT GGCCTATCT CAGTAT TCCAGGAACTGCAAAGGTGGG CACT 
If! T GGGCATACAAT CT TCAAGCCAAAGC GAACC CAGAAACATT AACTATTACAGTAACTTC T CGAGCAGCAAATTCTT CT 

GTGCCT CCAAT C ACAGT GAAT GCTAAAAT GAAT AAGGAC GTAAACAGT TTCC CCAGCCCAAT GATT GTTTACGCAGAA 
ATT CTACAAGGATATGTACCT GTT CTT GGAG CCAAT GT GACT G C TT T CATT GAAT CACAGAAT GGACATACAGAAGTT 
O TTGGAACTTTTGGATAATGGTGCAGGCGCTGATTCTTTC^VAGAATGATGGAGTCTACTCCAGGTATTTTACAGCATAT 
ACAGAAAAT GGCAGATATAGCTTAAAAGT TC GGGCT CAT GGAGGAGCAAACACT GC CAGGCT AAAAT TACGGCC TC C A 
W CT GAATAGAGCC GCGTACATACCAGGCT GGGT AGT GAAC GGGGAAATT GAAG CAAACCC GCCAAGACCT GAAAT T GAT 

|;!| GAGGAT ACT CAGACCAC CTTGGAGGAT T TCAGC C GAACAGCAT CCGGAGGT GCATTT GT GGTAT CAC AAGT CCCAAGC 

'i7% CTTCCCTTGCCTGACCAATACCCACCAAGTCAAATCACAGACCTTGATGCCACAGTTCATGAGGATAAGATTATTCTT 
ACAT GGACAGCAC CAGGAGATAATTTT GATGT T G GAAAAGTT CAAC GT TAT AT CATAAGAATAAGT GCAAGT ATTCT T 
Q GATCTAAGAGACAGTTTTGATGATGCTCTTCAAGTAAATACTACTGATCTGTCACCAAAGGAGGCCAACTCCAAGGAA 
Ml AGCTTT GCATT TAAACCAGAAAATAT CT CAGAAGAAAAT GC AACCCACATATT TATT GC CATTAAAAGTATAGATAAA 

2 ^ AGCAATTT GACAT CAAAAGTATC CAACAT TG C ACAAGTAACTTT GT TTATC C CT CAAGCAAAT CCT GAT GACATTGAT 

C CT ACACCTACT C CTACT CCTACT CCT ACTC CT GAT AAAAGT CATAAT T CT G GAGTTAATATTT CTACGCT GGTATT G 
T CT GTGATT GGGT CTGTT GTAATT GTT AACT T TATT T T AAGT ACCAC CATT TGAA C CTTAAC GAAGAAAAAAAT CTT C 
AAGTAGACCTAGAAGAGAGTTTTAAAAAACAAAACAATGTAAGTAAAGGATATTTCTGAATCTTAAAATTCATCCCAT 
GT GT GAT CATAAACTCATAAAAATAAT T TTAAGAT GT CG GAAAAGGATACT TT GATT AAAT AAAAACACTCATGGAT A 
TGTAAAAACTGTC^AGATTAAAATTTAATAGTTTCATTTATTTGTTATTTTATTTGTAAGAAATAGTGATGAACAAAG 
ATCCTTTTTCATACTGATACCTGGTTGTATATTATTTGATGCAACAGTTTTCTGAAATGATATTTCAAATTGCATCAA 
GAAAT TAAAAT CAT CTAT CT GAGTAGT GAAAAT ACAAGTAAAG GAGAGCAAATAAACAAC ATT T G GAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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MGLFRGFVFLLVLCLLHQSNTSFIKLNNNGFEDIVIVIDPSVPEDEKI IEQIEDMVTTASTYLFE 
ATEKRFFFKNVS I L I PENWKENPQYKRPKHENHKHADVI VAPPTLPGRDEPYTKQFTECGEKGEY 
IHFTPDLLLGKKQNEYGPPGKLFVHEWAHLRWGVFDEYNEDQPFYRAKSKKIEATRCSAGISGRN 
RVYKCQGGSCLSRACRI DS TTKLYGKDCQFFPDKVQTEKAS IMFMQS I DSWEFCNEKTHNQEAP 
SLQNIKCNFRSTWEVISNSEDFKNTIPMVTPPPPPVFSLLKISQRIVCLVLDKSGSMGGKDRLNR 
MNQAAKHFLLQTVENGSWVGMVHFDSTATIWKLIQIKSSDERNTLMAGLPTYPLGGTSIGSGIK 
YAFQVIGELHSQLDGSEVLLLTDGEDNTASSCIDEVKQSGAIVHFIALGRAADEAVIEMSKITGG 
SH FYVS DE AQNNGL I DAFGALT S GNT DLSQKS LQLE S KGLTLNSNAWMNDT VI IDS TVGKDT FFL 
ITWNSLPPSISLWDPSGTIMENFTVDATSKMAYLSIPGTAKVGTWAYNLQAKANPETLTITVTSR 
AANS SVPP I TVNAKMNKDVNS FPS PMI VYAE I LQG YVPVLGANVTAFIESQNGHTEVLELLDNGA 
GADSFJCNDGVYSRYFTAYTENGRYSLKVRAHGGANTARLKLRPPLNRAAYIPGWVVNGEIEANPP 
RPEIDEDTQTTLEDFSRTASGGAFWSQVPSLPLPDQYPPSQITDLDATVHEDKIILTWTAPGDN 
FDVGKVQRYIIRISASILDLRDSFDDALQVNTTDLSPKEANSKESFAFKPENISEENATHIFIAI 
KSIDKSNLTSKVSNIAQVTLFIPQANPDDIDPTPTPTPTPTPDKSHNSGVNISTLVLSVIGSWI 
VNFILSTTI 

Signal peptide: 

amino acids 1-21 

Putative transmembrane domains; 

amino acids 284-300, 617-633 

Leucine zipper pattern. 

amino acids 469-491, 476-498 

N-glycosylation site. 

amino acids 20-24, 75-79, 340-344, 504-508, 542-546, 588-592, 
628-632, 811-815, 832-836, 837-841, 852-856, 896-900 
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FIGURE 71 



CTCCTTAGGTGGAAACCCTGGGAGTAGAGTACTGACAGCAAAGACCGGGAAAGACCATACGTCCCCGGGCAGGGGTGA 
CAACAGGTGTCATCTTTTTGATCTCGTGTGTGGCTGCCTTCCTATTTCAAGGAAAGACGCCAAGGTAATTTTGACCCA 
GAGGAGCAAT GAT GTAGCCAC C T CCTAACCT T CC CTTCTT GAAC CCCCAGTT AT GC CAGGATTTACT AGAGAGT GT CA 
ACTCAACCAGCAAGCGGCTCCTTCGGCTTAACTTGTGGTTGGAGGAGAGAACCTTTGTGGGGCTGCGTTCTCTTAGCA 
GTGCT CAGAAGTGACT T G C CT GAGGGT GGAC CAGAAGAAAGGAAAGGT C CC CT CTT GCT GT T G GCT G CACAT CAGGAA 
GGCTGTGATGGGAATGAAGGTGAAAACTTGGAGATTTCACTTCAGTCATTGCTTCTGCCTGCAAGATCATCCTTTAAA 
AGT AGAGAAGCT GCTCT GT GT GGT GGTTAACT C CAAGAG GCAGAACT CGTT CTAGAAGGAAAT GGAT GC AAGCAGCT C 
CGGGGGCCCCAAACGCATGCTTCCTGTGGTCTAGCCCAGGGAAGCCCTTCCGTGGGGGCCCCGGCTTTGAGGGATGCC 
ACCGGTTCTGGACGCATGGCTGATTCCTGAATGATGATGGTTCGCCGGGGGCTGCTTGCGTGGATTTCCCGGGTGGTG 
GTTTTGCTGGTGCTCCTCTGCTGTGCTATCTCTGTCCTGTACATGTTGGCCTGCACCCCAAAAGGTGACGAGGAGCAG 
CTG GCACT GCCCAG GGC CAACAGC CCCAC GG GGAAGGAG GGGTAC CAG GCC GT CCT T CAGGAGT GG GAGGAGCAGCAC 
CGCAACTACGT GAG CAG CCTGAAGCGGCAGATCG CACAGCT CAAGGAGGAG C T GCAGGAGAGGAGT GAG CAGCT CAGG 

14' AATGGGCAGTACCAAGCCAGCGATGCTGCTGGCCTGGGTCTGGACAGGAGCCCCCCAGAGAAAACCCAGGCCGACCTC 
CTGGCCTT CCT GCACT C GCAGGT GGACAAGGCAGAGGT GAAT G CT GG CGT CAAG CT GGCCACAGAGTAT GCAGCAGT G 

5f CCTTTCGATAGCTTTACTCTACAGAAGGTGTACCAGCTGGAGACTGGCCTTACCCGCCACCCCGAGGAGAAGCCTGTG 

0 AGGAAGGACAAGC GGGAT GAGTT GGT G GAAGCCATT GAAT C AGCCTT GGAGACC CT GAACAAT CCT GCAGAGAACAGC 
ft% CCCAAT CACCGT C CTTACACGGC CTCT GATT TCAT AGAAGGGAT CT ACCGAACAGAAAGGGACAAAGGGACATT GTAT 

■ GAGCTCACCTTCAAAGGGGACCACAAACACGAATTCAAACGGCTCATCTTATTTCGACCATTCAGCCCCATCATGAAA 
Uj GTGAAAAATGAAAAGCTCAACATGGCCAACACGCTTATCAATGTTATCGTGCCTCTAGC^ 

^ J C GGCAGTT CAT GCAGAATTTCAGGGAGAT GT GCATT GAGCAG GAT GGGAGAGT C CAT CT CACTGTT GTTTACTT T GGG 

AAAGAAGAAATAAAT GAAGTCAAAGGAAT ACTT GAAAACACT T CCAAAGCT GCCAACTT CAGGAACTT TACCTT CAT C 
UJ CAGCTGAATGGAGAATTTTCTCGGGGAAAGGGACTTGATGTTGGAGCCCGCTTCTGGAAGGGAAGCAACGTCCTTCTC 

1 p TTTTTC T GT GAT GT GGACATCTACTT CACAT CT GAAT.TC CT CAAT AC GT GTAGGCT GAATACACAGCCAGGGAAGAAG 

GTATTTTATCCAGTTCTTTTCAGTC^GTACAATCCTGGCATAATATACGGCCACCATGATGCAGTCCCTCCCTTGGAA 
iS CAGCAGCT GGT CAT AAAGAAGGAAACT GGAT TT T GGAGAGACT TT GGATTT GGGAT GACGT GT CAGTAT C GGTCAGAC 

f»| TTCATCAATATAGGTGGGTTTGATCTGGACATCAAAGGCTGGGGCGGAGAGGATGTGCACCTTTATCGCAAGTATCTC 
?;! CACAGCAACCTCATAGTGGTACGGACGCCTGTGCGAGGACT CTTCCACCTCTGGCATGAGAAGCGCTGCATGGACGAG 

y i C T GACC CC CGAGCAGTACAAGAT GTGCAT GCAGT C C AAGGC CAT GAAC GAGGCAT C C CAC GGCCAGCT GGGCAT GCT G 

p|. GTGTTC AGGCAC GAGAT AGAGG CT CACCT TC GCAAACAGAAACAGAAGACAAGTAGCAAAAAAACATGAACT CCCAGA 

GAAGGATTGTGGGAGACACTTTTTCTTTCCTTTTGCAATTACTGAAAGTGGCTGCAACAGAGAAAAGACTTCCATAAA 
B| GGAC GACAAAAGAATT GGACT GAT GGGT CAGAGATGAGAAAG C CT C C GATT T CT CT CT GTT GGGCTTTTTACAAC AGA 

■m AAT CAAAAT CT C C GCTT T G CCT GCAAAAGTAAC C CAGT T GCAC CCT GT GAAGT GT CT GACAAAGGCAGAAT GCT T GT G 

- 31.,! 

AGATTATAAGCCTAAT GGT GT G GAGGT T T TGAT GGT GT TT ACAATACACT GAGACCT GT T GTTTT GT GT GCT CATT GA 
fll AATATT CATGATT TAAGAGCAGT T TT GTAAAAAATT CATT AG CAT GAAAGGC AAGC ATATT T CT C CT CAT AT GAAT G A 

GCCTAT CAGCAGGGCT CTAGTTTCTAGGAAT GCT AAAAT AT CAGAAGGCAGGAGAG GAGAT AGGCTTAT TAT GAT ACT 
AGTGAGT ACATTAAGTAAAATAAAAT GGAC CAGAAAAGAAAAGAAAC CATAAATAT C GT GT CATATTTT C C C CAAGAT 
TAACCAAAAATAATCTGCTTATCTTTTTGGTTGTCCTTTTAACTGTCTCCGTTTTTTTCTTTTATTTAAAAATGCACT 
T TTTTT CC CTT GT GAGT TATAGT CTGCTTATTTAATTACCAC T TT GCAAGCCT TACAAGAGAGCACAAGT T GGCCT AC 
ATTTTTATATTTTTTAAGAAGATACTTTGAGATGCATTATGAGAACTTTCAGTTCAAAGCATCAAATTGATGCCATAT 
CCAAGGACAT GC CAAAT GCTGAT T CT GT CAGG CACT GAAT GT CAGG C ATTGAGACATAGGGAAGGAATGGTT TGTACT 
AATACAGACGTACAGATACTTT CT CT GAAGAGTATTT TC GAAGAGGAGC AACT GAACACTGGAGGAAAAGAAAAT GAC 
ACTTTCT GCTTTACAGAAAAGGAAACT CATT CAGACT GGT GATATC GT GAT GT ACCTAAAAGT CAGAAACCACATTT T 
CTCCTCAGAAGTAGGGACCGCTTTCTTACCTGTTTAAATAAACCAAAGTATACCGTGTGAACCAAACAATCTCTTTTC 
AAAACAG GGTGC T CCT C CT GGCT T CT GGCTT CCATAAGAAGAAATG GAGAAAAATATATATATATATAT ATATATT GT 
GAAAGATCAATCCATCTGCCAGAATCTAGTGGGATGGAAGTTTTTGCTACATGTTATCCACCCCAGGCCAGGTGGAAG 
TAACT GAATT AT T T TT TAAATTAAGCAGTT C TAC TCAAT CAC CAAGAT GCT T CT GAAAATT GCATT TTATTACCATT T 
CAAACTATTTTT TAAAAATAAATACAGTT AACATAGAGT GGT TT CT T CATT CAT GT GAAAATTAT TAGC CAGCAC CAG 
ATGCATGAGCTAATTATCTCTTTGAGTCCTTGCTTCTGTTTGCTCACAGTAAACTCATTGTTTAAAAGCTTCAAGAAC 
ATTCAAG C T GTT GGTGT GTTAAAAAATGCAT TGTATT GATTT GTACT GGTAGT T TAT GAAATTTAATTAAAACACAGG 
CCAT GAAT GGAAGGTGGTATT GCACAGCTAAT AAAATAT GAT T T GT GGATAT GAA 
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MMMVRRGLLAWISRVWLLVLLCCAISVLYMLACTPKGDEEQLALPRANSPTGKEGYQAVLQEWE 
EQHRNYVSSLKRQIAQLKEELQERSEQLRNGQYQASDAAGLGLDRSPPEKTQADLLAFLHSQVDK 
AEWAGVKLATEYAAVPFDSFTLQKVYQLETGLTRHPEEKPVRKDKRDELVEAIESALETLNNPA 
ENS PNHRP YTAS DFI E G I YRTERDKGT LYE LT FKGDHKHE FKRL I L FRPFS P IMKVKNE KLNMAN 
TLINVIVPIAKRVDKFRQFMQNFREMCIEQDGRVHLTWYFGKEEINEVKGILENTSKAANFRNF 
T FI QLNGE FSRGKGLDVGARFWKGSNVLLFFCDVD I YFTSEFLNTCRLNTQPGKKVFYPVLFSQY 
NPGIIYGHHDAVPPLEQQLVIKKETGFWRDFGFGMTCQYRSDFINIGGFDLDIKGWGGEDVHLYR 
KYLHSNLIVWTPTOGLFHLWHEKRCMDELTPEQYKMCMQSKAMNEASHGQLGM^ 
RKQKQKTS SKKT 

Important features: 
Signal peptide: 

amino acids 1-27 

N-glycosylation sites. 

amino acids 315-319/ 324-328 

N-rnyristoylation sites. 

amino acids 96-102, 136-142, 212-218, 311-317, 339-345, 393-399 

Amidation site. 

amino acids 377-381 
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FIGURE 73 

GAGAC TGCAGAGGGAGATAAAGAGAGAGGGCAAAGAGGCAGCAAGAGATT TGTCCTGGGGATCCA 
GAAACCCATGATACCCTACTGAACACCGAATCCCCTGGAAGCCCACAGAGACAGAGACAGCAAGA 
GAAGCAGAGATAAATACACTCACGCCAGGAGCTCGCTCGCTCTCTCTCTCTCTCTCTCACTCCTC 
CCTCCCTCTCTCTCTGCCTGTCCTAGTCCTCTAGTCCTCAAATTCCCAGTCCCCTGCACCCCTTC 
CTGGGACACTATGTTGTTCTCCGCCCTCCTGCTGGAGGTGATTTGGATCCTGGCTGCAGATGGGG 
GTCAACACTGGACGTATGAGGGCCCACATGGTCAGGACCATTGGCCAGCCTCTTACCCTGAGTGT 
GGAAACAATGCCCAGTCGCCCATCGATATTCAGACAGACAGTGTGACATTTGACCCTGATTTGCC 
TGCTCTGCAGCCCCACGGATATGACCAGCCTGGCACCGAGCCTTTGGACCTGCACAACAATGGCC 
ACACAGTGCAACTCTCTCTGCCCTCTACCCTGTATCTGGGTGGACTTCCCCGAAAATATGTAGCT 
GCCCAGCTCCACCTGCACTGGGGTCAGAAAGGATCCCCAGGGGGGTCAGAACACCAGATCAACAG 
TGAAGCCACATTTGCAGAGCTCCACATTGTACATTATGACTCTGATTCCTATGACAGCTTGAGTG 
AGGCTGCTGAGAGGCCTCAGGGCCTGGCTGTCCTGGGCATCCTAATTGAGGTGGGTGAGACTAAG 
AAT AT AG C T T AT G AAC AC AT T C T GAG T C AC T T GC AT GAAG T CAG G C AT AAAG AT C AGAAG AC C T C 
AGTGCCTCCCTTCAACCTAAGAGAGCTGCTCCCCAAACAGCTGGGGCAGTACTTCCGCTACAATG 
GCTCGCTCACAACTCCCCCTTGCTACCAGAGTGTGCTCTGGACAGTTTTTTATAGAAGGTCCCAG 
AT TTCAATGGAACAGCTGGAAAAGCTTCAGGGGACAT TGTTC T CCACAGAAGAGGAGCCCTCTAA 
GCTTCTGGTACAGAACTACCGAGCCCTTCAGCCTCTCAATCAGCGCATGGTCTTTGCTTCTTTCA 
TCCAAGCAGGATCCTCGTATACCACAGGTGAAATGCTGAGTCTAGGTGTAGGAATCTTGGTTGGC 
TGTCTCTGCCTTCTCCTGGCTGTTTATTTCATTGCTAGAAAGATTCGGAAGAAGAGGCTGGAAAA 
CC G AAAGAGT G T GGT CT TCAC C TCAGCACAAGCC AC GAC TGAGGC ATAAAT T CC T T C T CAGATAC 
CATGGATGTGGATGACTTCCCTTCATGCCTATCAGGAAGCCTCTAAAATGGGGTGTAGGATCTGG 
CCAGAAACACTGTAGGAGTAGTAAGCAGATGTCCTCCTTCCCCTGGACATCTCTTAGAGAGGAAT 
GGAC C C AGGCT GT CAT T C C AGGAAGAAC T GCAGAGCC T T CAGCCT C T CC AAACAT GTAGGAGGAA 
ATGAGGAAATCGCTGTGTTGTTAATGCAGAGANCAAACTCTGTTTAGTTGCAGGGGAAGTTTGGG 
ATATACCCCAAAGTCCTCTACCCCCTCACTTTTATGGCCCTTTCCCTAGATATACTGCGGGATCT 
C T C C T T AGGAT AAAGAG TTGCTGTT G AAGT T G TAT AT T T T T GAT CAAT AT AT T T G G AAAT T AAAG 
TTTCTGACTTT 
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MLFSALLLEVIWILAADGGQHWTYEGPHGQDHWPASYPECGNNAQSPIDIQTDSVTFDPDLPALQ 
PHGYDQPGTEPLDLHNNGHTVQLSLPSTLYLGGLPRKYVAA.QLHLHWGQKGSPGGSEHQINSEAT 
FAELHIVHYDSDSYDSLSEAAERPQGLAVLGILIEVGETKNIAYEHILSHLHEVRHKDQKTSVPP 
FNLRELLPKQLGQYFRYNGSLTTPPCYQSVLWTVFYRRSQISMEQLEKLQGTLFSTEEEPSKLLV 
QNYRALQPLNQRMVFAS F I QAGS S YT T GEMLSLGVG I LVGCLCLLLAVY FI ARKI RKKRLENRKS 
WFTSAQATTEA 

Important features of the protein: 
Signal peptide: 

amino acids 1-15 

Transmembrane domain: 

amino acids 291-310 

N-glycosylation site. 

amino acids 213-216 

Eukaryotic-type carbonic anhydrases proteins 

amino acids 197-245, 104-140, 22-69 
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TGCCGCTGCCGCCGCTGCTGCTGTTGCTCCTGGCGGCGCCTTGGGGACGGGCAGTTCCCTGTGTC 
TCTGGTGGTTTGCCTAAACCTGCAAACATCACCTTCTTATCCATCAACATGAAGAATGTCCTACA 
AT GGAC T C C ACC AGAGGGT C T TC AAGGAGT T AAAGT TAC T TACACT GT GC AG TAT T T CAT CACAA 
ATTGGCCCACCAGAGGTGGCACTGACTACAGATGAGAAGTCCATTTCTGTTGTCCTGACAGCTCC 
AGAGAAGT GGAAGAGAAAT CCAGAAGAC CTTCCTGTT TCCAT GCAACAAAT ATAC TC CAAT C T GA 
AGTATAACGTGTCTGTGTTGAATACTAAATCAAACAGAACGTGGTCCCAGTGTGTGACCAACCAC 
ACGCTGGTGCTCACCTGGCTGGAGCCGAACACTCTTTACTGCGTACACGTGGAGTCCTTCGTCCC 
■fM AGGGCCCCCTCGCCGTGCTCAGCCTTCTGAGAAGCAGTGTGCCAGGACTTTGAAAGATCAATCAT 
p CAGAGTTCAAGGCTAAAATCATCTTCTGGTATGTTTTGCCCATATCTATTACCG'TGTTTCTTTTT 
EP T C T G T GAT GG GC TAT T C CAT C TACCGAT AT ATCCAC GT T GGCAAAGAGAAAC ACCCAGCAAAT T T 

GATTTTGATTTATGGAAATGAATTTGACAAAAGATTCTTTGTGCCTGCTGAAAAAATCGTGATTA 
s ]j AC T T TAT C AC C C T CAAT AT C TCGG AT GAT T C T AAAAT T T C T CAT C AGGAT AT GAG T T TAC T GG G A 

U| AAAAG CAGT GATGTAT CCAGCCT TAAT GAT CC TCAGCC C AGCGGGAAC C T GAGGCCCCCTCAGGA 

g GGAAGAGGAGGTGAAACATTTAGGGTATGCTTCGCATTTGATGGAAATTTTTTGTGACTCTGAAG 

P AAAACACGGAAGGTACTTCTCTCACCCAGCAAGAGTCCCTCAGCAGAACAATACCCCCGGATAAA 

s ir! 

jjj ACAGTCATTGAATATGAATATGATGTCAGAACCACTGACATTTGTGCGGGGCCTGAAGAGCAGGA 
S GC TCAG TT T GCAGGAGGAGGTGT CCACACAAGGAACAT TAT TGGAGTC GCAGGCAGCGT TGGCAG 

f| TCTTGGGCCCGCAAACGTTACAGTACTCATACACCCCTCAGCTCCAAGACTTAGACCCCCTGGCG 
f|| CAGGAGCACACAGACTCGGAGGAGGGGCCGGAGGAAGAGCCATCGACGACCCTGGTCGACTGGGA 
TCCCCAAACTGGCAGGCTGTGTATTCCTTCGCTGTCCAGCTTCGACCAGGATTCAGAGGGCTGCG 
AGCCTTCTGAGGGGGATGGGCTCGGAGAGGAGGGTCTTCTATCTAGACTCTATGAGGAGCCGGCT 
CCAGACAGGCCACCAGGAGAAAATGAAACCTATCTCATGCAATTCATGGAGGAATGGGGGTTATA 
TGTGCAGATGGAAAAC TGA TGCCAACACTTCCTTTTGCCTTTTGTTTCCTGTGCAAACAAGTGAG 
TCACCCCTTTGATCCCAGCCATAAAGTACCTGGGATGAAAGAAGTTTTTTCCAGTTTGTCAGTGT 
CTGTGAGAATTACTTATTTCTTTTCTCTATTCTCATAGCACGTGTGTGATTGGTTCATGCATGTA 
GGTCTCTTAACAATGATGGTGGGCCTCTGGAGTCCAGGGGCTGGCCGGTTGTTCTATGCAGAGAA 
AGCAGTCAATAAATGTTTGCCAGACTGGGTGCAGAATTTATTCAGGTGGGTGT 
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MSYNGLHQRVFKELKLLTLCSISSQIGPPEVALTTDEKSISVVLTAPEKWKRNPEDLPVSMQQIY 
SNLKYNVSVLNTKSNRTWSQCVTNHTLVLTWLEPNTLYCVHVESFVPGPPRRAQPSEKQCARTLK 
DQS SE FKAKI I FWYVLP I S I TVFLFSVMGYS I YRY IHVGKEKHPANLI LI YGNE FDKRFFVPAEK 
IVINFITLNISDDSKISHQDMSLLGKSSDVSSLNDPQPSGNLRPPQEEEEVKHLGYASHLMEIFC 
DSEENTEGTSLTQQESLSRTIPPDKTVIEYEYDVRTTDICAGPEEQELSLQEEVSTQGTLLESQA 
ALAVLGPQTLQYSYTPQLQDLDPLAQEHTDSEEGPEEEPSTTLVDWDPQTGRLCIPSLSSFDQDS 
EGCEPSEGDGLGEEGLLSRLYEEPAPDRPPGENETYLMQFMEEWGLYVQMEN 



Q Important features: 



Signal peptide: 

amino acids 1-28 

Transmembrane domain: 

amino acids 140-163 

N-glycosylation sites. 

amino acids 71-74, 80-83, 89-92, 204-207, 423-426 
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FIGURE 77 



GAGGAGCGGGCCGAGGACTCCAGCGTGCCCAGGTCTGGCATCCTGCACTTGCTGCCCTCTGACAC 
CTGGGAAG ATGG CCGGCCCGTGGACCTTCACCCTTCTCTGTGGTTTGCTGGCAGCCACCTTGATC 
CAAGCCACCCTCAGTCCCACTGCAGTTCTCATCCTCGGCCCAAAAGTCATCAAAGAAAAGCTGAC 
ACAGGAGCTGAAGGACCACAACGCCACCAGCATCCTGCAGCAGCTGCCGCTGCTCAGTGCCATGC 
GGGAAAAGCCAGCCGGAGGCATCCCTGTGCTGGGCAGCCTGGTGAACACCGTCCTGAAGCACATC 
AT CT GGCT GAAGG TCATCACAGCTAACAT CC TC CAGC TGCAGGT GAAG C CC T C GGC CAAT GACCA 
GGAGCTGCTAGTCAAGATCCCCCTGGACATGGTGGCTGGATTCAACACGCCCCTGGTCAAGACCA 
JlJ T CGT GGAGT T CCACATGACGACT GAGGCCCAAGCCAC CATC CGCAT GGAC AC CAGTGCAAGT GGC 

jS CCCACCCGCCTGGTCCTCAGTGACTGTGCCACCAGCCATGGGAGCCTGCGCATCCAACTGCTGTA 
|p TAAGCTCTCCTTCCTGGTGAACGCCTTAGCTAAGCAGGTCATGAACCTCCTAGTGCCATCCCTGC 
W CCAATCTAGTGAAAAACCAGCTGTGTCCCGTGATCGAGGCTTCCTTCAATGGCATGTATGCAGAC 
CTCCTGCAGCTGGTGAAGGTGCCCATTTCCCTCAGCATTGACCGTCTGGAGTTTGACCTTCTGTA 
7k TCCTGCCATCAAGGGTGACACCATTCAGCTCTACCTGGGGGCCAAGTTGTTGGACTCACAGGGAA 
s AGGTGACCAAGTGGTTCAATAACTCTGCAGCTTCCCTGACAATGCCCACCCTGGACAACATCCCG 
Q TTCAGCCTCATCGTGAGTCAGGACGTGGTGAAAGCTGCAGTGGCTGCTGTGCTCTCTCCAGAAGA 
IJ1 ATTCATGGTCCTGTTGGACTCTGTGCTTCCTGAGAGTGCCCATCGGCTGAAGTCAAGCATCGGGC 
1*1 TGAT CAAT GAAAAGGC TGCAGAT AAGCTGGGATCTACCCAGAT CGTGAAGAT CC TAACTCAGGAC 

2~ ACTCCCGAGTTTTTTATAGACCAAGGCCATGCCAAGGTGGCCCAACTGATCGTGCTGGAAGTGTT 
Ffi TCCCTCCAGTGAAGCCCTCCGCCCTTTGTTCACCCTGGGCATCGAAGCCAGCTCGGAAGCTCAGT 
TTTACACCAAAGGTGACCAACTTATACTCAACTTGAATAACATCAGCTCTGATCGGATCCAGCTG 
ATGAACTCTGGGATTGGCTGGTTCCAACCTGATGTTCTGAAAAACATCATCACTGAGATCATCCA 
CTCCATCCTGCTGCCGAACCAGAATGGCAAATTAAGATCTGGGGTCCCAGTGTCATTGGTGAAGG 
CCTTGGGATTCGAGGCAGCTGAGTCCTCACTGACCAAGGATGCCCTTGTGCTTACTCCAGCCTCC 
TTGTGGAAACCCAGCTCTCCTGTCTCCCAGTGAAGACTTGGATGGCAGCCATCAGGGAAGGCTGG 
G T CC CAGC TGGGAGTATGGGTGT GAGC T C TATAGACCATCC CT C T C T GCAAT CAAT AAACAC T TG 
CCTGTGAAAAA 
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FIGURE 78 



MAGPWTFTLLCGLLAATLIQATLSPTAVLILGPKVIKEKLTQELKDHNATSILQQLPLLSAMREK 
PAGGIPVLGSLWTVLKHIIWLKVITANILQLQVKPSANDQSLLVKIPLDMVAGFNTPLVKTIVE 
FHMTTEAQATIRMDTSASGPTRLVLSDCATSHGSLRIQLLYKLSFLWAIAKQVMNLLVPSLPNL 
VKNQLCPVIEAS FNGMYADLLQLVKVPISLSIDRLEFDLLYPAIKGDTIQLYLGAKLLDSQGKVT 
KWFl^SAASLTMPTLDNIPFSLIVSQDVVKAAVAAVLSPEEFMVLLDSVLPESAHRLKSSIGLIN 
EKAADKLGSTQIVKILTQDTPEFFIDQGHAKVAQLIVLEVFPSSEALRPLFTLGIEASSEAQFYT 
KGDQL I LNLNN I S S DR I QLMNS G I GW FQPDVLKN IITEIIHSI LLPNQNGKLRS GVPVS L VKAL G 
FEAAESSLTKDALVLTPASLWKPSSPVSQ 

Important features of the protein: 
Signal peptide: 

amino acids 1-21 

N-glycosylation sites. 

amino acids 48-51, 264-267, 401-404 

Glycosaminoglycan attachment site. 

amino acids 412-415 

LBP / BPI / CETP family proteins. 

amino acids 407-457 



APP 



ID- 



10063735 
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GAGAGAAGT CAGCCTGGCAGAGAGACTCTGAAATGAGGGAT TAGAGGTGTT CAAGGAGCAAGAGC 
TTCAGCCTGAAGACAAGGGAGCAGTCCCTGAAGACGCTTCTACTGAGAGGTCTGCCATGGCCTCT 
CTTGGCCTCCAACTTGTGGGCTACATCCTAGGCCTTCTGGGGCTTTTGGGCACACTGGTTGCCAT 
GCTGCTCCCCAGCTGGAAAACAAGTTCTTATGTCGGTGCCAGCATTGTGACAGCAGTTGGCTTCT 
CCAAGGGC C T C TGGAT GGAATGT GCCACACACAGCACAGGCAT CACCCAGTG T GACATC TAT AGC 
ACCCTTCTGGGCCTGCCCGCTGACATCCAGGCTGCCCAGGCCATGATGGTGACATCCAGTGCAAT 
CTCCTCCCTGGCCTGCATTATCTCTGTGGTGGGCATGAGATGCACAGTCTTCTGCCAGGAATCCC 
GAGCCAAAGACAGAGTGGCGGTAGCAGGTGGAGTCTTTTTCATCCTTGGAGGCCTCCTGGGATTC 
ATTCCTGTTGCCTGGAATCTTCATGGGATCCTACGGGACTTCTACTCACCACTGGTGCCTGACAG 
CATGAAATTTGAGATTGGAGAGGCTCTTTACTTGGGCATTATTTCTTCCCTGTTCTCCCTGATAG 
CTGGAATCATCCTCTGCTTTTCCTGCTCATCCCAGAGAAATCGCTCCAACTACTACGATGCCTAC 
CAAGCC CAAC CT C T TGCCACAAGGAGC T CT C CAAGGCCTGGT CAACC T CCCAAAG T CAAGAGT GA 
GT TCAAT T C C TACAGC CTGACAGGGTAT GT GTGAAGAAC CAGGGGCCAGAGCTGGGGGGT GGC T G 
GGTCTGTGAAAAACAGTGGACAGCACCCCGAGGGCCACAGGTGAGGGACACTACCACTGGATCGT 
GT CAGAAGG T GCTGC T GAGGAT AGACT GACT T TGGCCAT TGGAT T GAGCAAAGGCAGAAATGGGG 
GCTAGTGTAACAGCATGCAGGTTGAATTGCCAAGGATGCTCGCCATGCCAGCCTTTCTGTTTTCC 
TCACCTTGCTGCTCCCCTGCCCTAAGTCCCCAACCCTCAACTTGAAACCCCATTCCCTTAAGCCA 
GGAC T C AGAGGAT CCCTTTGCCCTCTGGTT T ACC T GGGAC T CCAT CC CC AAAC CC AC TAAT C AC A 
TCCCACTGACTGACCCTCTGTGATCA?yiGACCCTCTCTCTGGCTGAGGTTGGCTCTTAGCTCATT 
GCTGGGGATGGGAAGGAGAAGCAGTGGCTTTTGTGGGCATTGCTCTAACCTACTTCTCAAGCTTC 
CCTCCAAAGAAACTGATTGGCCCTGGAACCTCCATCCCACTCTTGTTATGACTCCACAGTGTCCA 
GACTAATTTGTGCATGAACTGAAATAAAACCATCCTACGGTATCCAGGGAACAGAAAGCAGGATG 
CAG GAT GGGAGGACAGGAAGGCAGCCT GGGACAT T TAAAAAAATA 
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FIGURE 80 

MASLGLQLVGY I LGLLGLLGTLVAMLLP SWKTS S YVGAS I VTAVGFSKGLWMECATHS TG I TQCD 
I YS T LLGLPAD I QAAQAMMVT S S AI SS LAC 1 1 S WGMRCT VFCQE SRAKDRVAVAGGVFFI LGGL 
LGFI PVAWNLHG I LRDFYS PLVPDSMKFE I GEAL YLG 1 1 S S L FSL LAG 1 1 LG FSC S S QRNRSN Y Y 
DAYQAQPLATRSSPRPGQPPKVKSEFNSYSLTGYV 



Important features of the protein: 
Signal peptide: 

amino acids 1-24 



Transmembrane domains: 

amino acids 82-102, 117-140, 163-182 



m N-glycosylation site. 

» amino acids 190-193 



o 
ru 



PMP-22 / EMP / MP20 family proteins. 

amino acids 4 6-59 
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FIGURE 81 



CCCACGCGTCCGCGCCTCTCCCTTCTGCTGGACCTTCCTTCGTCTCTCCATCTCTCCCTCCTTTC 
CCCGCGTTCTCTTTCCACCTTTCTCTTCTTCCCACCTTAGACC'TCCCTTCCTGCCCTCCTTTCCT 
GCCCACCGCTGCTTCCTGGCCCTTCTCCGACCCCGCTCTAGCAGCAGACCTCCTGGGGTGTGTGG 
GTTGATCTGTGGCCCCTGTGCCTCCGTGTCCTTTTCGTCTCCCTTCCTCCCGACTCCGCTCCCGG 
ACCAGCGGCCTGACCCTGGGGAAAGGATGGTTCCCGAGGTGAGGGTCCTCTCCTCCTTGCTGGGA 
CTCGCGCTGCTCTGGTTCCCCCTGGACTCCCACGCTCGAGCCCGCCCAGACA.TGTTCTGCCTTTT 
CCATGGGAAGAGATACTCCCCCGGCGAGAGCTGGCACCCCTACTTGGAGCCACAAGGCCTGATGT 
ACTGCCTGCGCTGTACCTGCTCAGAGGGCGCCCATGTGAGTTGTTACCGCCTCCACTGTCCGCCT 
O GTCCACTGCCCCCAGCCTGTGACGGAGCCACAGCAATGCTGTCCCAAGTGTGTGGAACCTCACAC 
fl TCCCTCTGGACTCCGGGCCCCACCAAAGTCCTGCCAGCACAACGGGACCATGTACCAACACGGAG 
Tl AGATCTTCAGTGCCCATGAGCTGTTCCCCTCCCGCCTGCCCAACCAGTGTGTCCTCTGCAGCTGC 
|s| ACAGAGGGCCAGATCTACTGCGGCCTCACAACCTGCCCCGAACCAGGCTGCCCAGCACCCCTCCC 
HI ACT GC CAGACTCC TGC T GCCAAGC CTGCAAAGAT GAGGCAAGTGAGCAAT CGGAT GAAGAGGACA 

GTGTGCAGTCGCTCCATGGGGTGAGACATCCTCAGGATCCATGTTCCAGTGATGCTGGGAGAAAG 
AGAGGCCCGGGCACCCCAGCCCCCACTGGCCTCAGCGCCCCTCTGAGCTTCATCCCTCGCCACTT 
CAGACCCAAGGGAGCAGGCAGCACAACT GT CAAGATC GTCC TGAAGGAGAAAC ATAAGAAAGCCT 
GTGTGCATGGCGGGAAGACGTACTCCCACGGGGAGGTGTGGCACCCGGCCTTCCGTGCCTTCGGC 
CCCTTGCCCTGCATCCTATGCACCTGTGAGGATGGCCGCCAGGACTGCCAGCGTGTGACCTGTCC 
CACCGAGTACCCCTGCCGTCACCCCGAGAAAGTGGCTGGGAAGTGCTGCAAGATTTGCCCAGAGG 
ACAAAGCAGACCCTGGCCACAGTGAGATCAGTTCTACCAGGTGTCCCAAGGCACCGGGCCGGGTC 
CTCGTCCACACATCGGTATCCCCAAGCCCAGACAACCTGCGTCGCTTTGCCCTGGAACACGAGGC 
CT CGGACT TGGTGGAGATC TACC T CT GGAAGC TGGTAAAAGATGAGGAAACTGAGGC TCAGAGAG 
GTGAAGTACCTGGCCCAAGGCCACACAGCCAGAATCTTCCACTTGACTCAGATCAAGAAAGTCAG 
GAAGCAAGACTTCCAGAAAGAGGCACAGCACTTCCGACTGCTCGCTGGCCCCCACGAAGGTCACT 
GGAACGTCTTCCTAGCCCAGACCCTGGAGCTGAAGGTCACGGCCAGTCCAGACAAAGTGACCAAG 
• AC AT AACAAAGAC C TAACAG T T GCAGAT AT G AGC T G T A TAAT TGT T G T TAT TATATA T T AAT AAA 
T AAG AAG T T G CA T T AC C C T CAAAAAAAAAAAAAAAAAAAAAA 



a 
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FIGURE 82 



MVPEVRVLSSLLGI^LWFPLDSHARARPDMFCLFHGKRYSPGESWHPYLEPQGLMYCLRCTCSE 
GAHVSCYRLHCPPVHCPQPVTEPQQCCPKCVEPHTPSGLRAPPKSCQHNGTMYQHGEIFSAHELF 
PSRLPNQCVLCSCTEGQIYCGLTTCPEPGCPAPLPLPDSCCQACKDEASEQSDEEDSVQSLHGVR 
HPQDPCSSDAGRKRGPGTPAPTGLSAPLSFIPRHFRPKGAGSTTVKIVLKEKHKKACVHGGKTYS 
HGEVWHPAFRAFGPLPCILGTCEDGRQDCQRVTCPTEYPCRHPEKVAGKCCKICPEDKADPGHSE 
I S S TRCPKAPGRVLVHT S VS PS PDNLRRFALEHEAS DLVE I YLWKLVKDEE TEAQRGE VPGPRPH 
SQNLPLDSDQESQEARLPERGTAIiPTARWPPRRSLERLPSPDPGAEGHGQSRQSDQDITKT 

Signal peptide : 

amino acids 1-25 
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FIGURE 83 



GACAGCTGTGTCTCGATGGAGTAGACTCTCAGAACAGCGCAGTTTGCCCTCCGCTCACGCAGAGCCTCTCC 
GTGGCTTCCGCACCTTGAGCATTAGGCCAGTTCTCCTCTTCTCTCTAATCCATCCGTCACCTCTCCTGTCA 
TCCGTTTCCATGCCGTGAGGTCCATTCACAGAACACATCCATGGCTCTCATGCTCAGTTTGGTTCTGAGTC 
TCCTCAAGCTGGGATCAGGGCAGTGGCAGGTGTTTGGGCCAGACAAGCCTGTCCAGGCCTTGGTGGGGGAG 
GACGCAGCATTCTCCTGTTTCCTGTCTCCTAAGACCAATGCAGAGGCCATGGAAGTGCGGTTCTTCAGGGG 
CCAGTTCTCTAGCGTGGTCCACCTCTACAGGGACGGGAAGGACCAGCCATTTATGCAGATGCCACAGTATC 
AAGGC AGG AC AAAACT GGT GAAGG AT TCT ATT GC GG AGGGGCGC AT CT C T CT GAGGCT GG AAAAC ATTACT 
GTGTTGGATGCTGGCCTCTATGGGTGCAGGATTAGTTCCCAGTCTTACTACCAGAAGGCCATCTGGGAGCT 
O ACAGGTGTCAGCACTGGGCTCAGTTCCTCTCATTTCCATCACGGGATATGTTGATAGAGACATCCAGCTAC 
W TCTGTCAGTCCTCGGGCTGGTTCCCCCGGCCCACAGCGAAGTGGAAAGGTCCACAAGGACAGGATTTGTCC 
S f J AC AGACTCC AGGAC AAAC AG AGAC AT GC AT GGCC T GTT T GAT GT GGAGATCT CT C T GACC GTC C AAGAGAA 

^5 CGCCGGGAGCATATCCTGTTCCATGCGGCATGCTCATCTGAGCCGAGAGGTGGAATCCAGGGTACAGATAG 
t*l GAGATACCTTTTTCGAGCCTATATCGTGGCACCTGGCTACCAAAGTACTGGGAATACTCTGCTGTGGCCTA 
iip TTTTTTGGCATTGTTGGACTGAAGATTTTCTTCTCCAAATTCCAGTGGAAAATCCAGGCGGAACTGGACTG 
a GAGAAGAAAGCACGGACAGGCAGAATTGAGAGACGCCCGGAAACACGCAGTGGAGGTGACTCTGGATCCAG 
jp AGACGGCTCACCCGAAGCTCTGCGTTTCTGATCTGAAAACTGTAACCCATAGAAAAGCTCCCCAGGAGGTG 
Ifl CCTCACTCTGAGAAGAGATTTACAAGGAAGAGTGTGGTGGCTTCTCAGAGTTTCCAAGCAGGGAAACATTA 
|I| CT GGGAGGTGGACG GAGGAC AC AAT AAAAGGT GG CGCGTG GGAGT GTGCC GGGATGAT GT GGAC AGGAGG A 

AGGAGT ACGT GACT TT GTCT CCC G AT CAT GGGT ACT GGGT CCT C AG AC T G AATGGAGAAC AT T T GT ATTT C 
j~f ACATTAAATCCCCGTTTTATCAGCGTCTTCCCCAGGACCCCACCTACAAAAATAGGGGTCTTCCTGGACTA 
5W TGAGTGTGGGACCATCTCCTTCTTCAACATAAATGACCAGTCCCTTATTTATACCCTGACATGTCGGTTTG 
AAGGCTTATTGAGGCCCTACATTGAGTATCCGTCCTATAATGAGCAAAATGGAACTCCCATAGTCATCTGC 
CC AGTC ACCC AGGAAT CAGAGAAAG AGGCC TCTT GGC AAAGGGCCT CT GC AATC CCAGAGAC AAGC AACAG 
T G AGTCCT CC TC AC AGGC AACC ACGC CC TTCCTC CCC AGGGGT GAAAT G TAGG ATGAAT C AC AT CCC ACAT 
TCTTCTTTAGGGATATTAAGGTCTCTCTCCCAGATCCAAAGTCCCGCAGCAGCCGGCCAAGGTGGCTTCCA 
GATGAAGGGGGACTGGCCTGTCCACATGGGAGTCAGGTGTCATGGCTGCCCTGAGCTGGGAGGGAAGAAGG 
CTGACATTACATTTAGTTTGCTCTCACTCCATCTGGCTAAGTGATCTTGAAATACCACCTCTCAGGTGAAG 
AACC GT CAGG AATTCC CAT C TCAC AGGC T GTGGT GT AGAT TAAGT AGAC AAGG AAT GT GAAT AAT GCT TAG 
AT CT T ATT GATGAC AG AG T G T ATCCT AAT G GT T T GT TCAT TAT ATT AC ACTT T C AGT AAAAAAA 
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a 

m 



ill 

m 



MMKLSLVLSLLKLGSGQWQVFGPDKPVQALVGEDAAFSCFLSPKTNAEAMEVRFFRGQFSSVVH 
LYRDGKDQPFMQMPQYQGRTKLVKDSIAEGRISLRLENITVLDAGLYGCRISSQSYYQKAIWELQ 
VSALGS VPL I S I TGYVDRDI QLLCQS S GW FPRP TAKWKGPQGQDLS TDS RTNRDMHGL FDVE I SL 
TVQENAGSISCSMRHAHLSREVESRVQIGDTFFEPISWHLATKVLGILCCGLFFGIVGLKIFFSK 
FQWKIQAELDWRRKHGQAELRDARKHAVEVTLDPETAHPKLCVSDLKTVTHRKAPQEVPHSEKRF 
TRKSWASQSFQAGKHYWEVDGGHNKRWRVGVCRDDVDRRKEYVTLSPDHGYWVLRLNGEHLYFT 
LNPRFISVFPRTPPTKIGVFLDYECGTISFFNINDQSLIYTLTCRFEGLLRPYIEYPSYNEQNGT 
PIVICPVTQESEKEASWQRASAIPETSNSESSSQATTPFLPRGEM 

Signal peptide: 

amino acids 1-17 

Transmembrane domain: 

amino acids 239-255 
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FIGURE 8* 



AACAGACGTTCCCTCGCGGCCCTGGCACCTCTAACCCCAGACATGCTGCTGCTGCTGCTGCCCCT 
GCTCTGGGGGAGGGAGAGGGCGGAAGGACAGACAAGTAAACTGCTGACGATGCAGAGTTCCGTGA 
CGGTGCAGGAAGGCCTGTGTGTCCATGTGCCCTGCTCCTTCTCCTACCCCTCGCATGGCTGGATT 
TACCCTGGCCCAGTAGTTCATGGCTACTGGTTCCGGGAAGGGGCCAATACAGACCAGGATGCTCC 
AGTGGCCACAAACAACCCAGCTCGGGCAGTGTGGGAGGAGACTCGGGACCGATTCCACCTCCTTG 
GGGACCCAC AT AC CAAGAAT TGCACCCTGAGCAT CAGAGAT GCCAGAAGAAGTGAT GCGGGGAGA 
T AC T T C T T T CG TAT GGAGAAAGGAAGT ATAAAAT GGAAT T AT AAAC AT C ACC GGC T C T C T G T GAA 
TGTGACAGCCTTGACCCACAGGCCCAACATCCTCATCCCAGGCACCCTGGAGTCCGGCTGCCCCC 
AGAATCTGACCTGCTCTGTGCCCTGGGCCTGTGAGCAGGGGACACCCCCTATGATCTCCTGGATA 
GGGACCTCCGTGTCCCCCCTGGACCCCTCCACCACCCGCTCCTCGGTGCTCACCCTCATCCCACA 
GCCCCAGGACCATGGCACCAGCCTCACCTGTCAGGTGACCTTCCCTGGGGCCAGCGTGACCACGA 
AC AAGACC G T C C AT CT CAAC GT GT C CT ACCCGCC TCAGAAC T TGAC CAT GAC T G T C T TC CAAGGA 
GACGGCACAGTATCCACAGTCTTGGGAAATGGCTCATCTCTGTCACTCCCAGAGGGCCAGTCTCT 
GCGCCTGGTCTGTGCAGTTGATGCAGTTGACAGCAATCCCCCTGCCAGGCTGAGCCTGAGCTGGA 
GAGGCCTGACCCTGTGCCCCTCACAGCCCTCAAACCCGGGGGTGCTGGAGCTGCCTTGGGTGCAC 
CTGAGGGATGCAGCTGAATTCACCTGCAGAGCTCAGAACCCTCTCGGCTCTCAGCAGGTCTACCT 
GAACGTCTCCCTGCAGAGCAAAGCCACATCAGGAGTGACTCAGGGGGTGGTCGGGGGAGCTGGAG 
CCACAGCCCTGGTCTTCCTGTCCTTCTGCGTCATCTTCGTTGTAGTGAGGTCCTGCAGGAAGAAA 
TCGGCAAGGCCAGCAGCGGGCGTGGGAGATACGGGCATAGAGGATGCAAACGCTGTCAGGGGTTC 
AGCCTCTCAGGGGCCCCTGACTGAACCTTGGGCAGAAGACAGTCCCCCAGACCAGCCTCCCCCAG 
CTTCTGCCCGCTCCTCAGTGGGGGAAGGAGAGCTCCAGTATGCATCCCTCAGCTTCCAGATGGTG 
AAGCCT T GGGACTCGCGGGGACAGGAGGCCAC TGACACCGAGTAC TCGGAGAT CAAGATC CACAG 
ATGAGAAACTGCAGAGACTCACCCTGATTGAGGGATCACAGCCCCTCCAGGCAAGGGAGAAGTCA 
GAGGCT GAT T C T TGTAGAAT TAACAGCCCTCAACG TGATGAGC TATGATAACAC TATGAAT TATG 
TGCAGAGTGAAAAGCACACAGGCTTTAGAGTCAAAGTATCTCAAACCTGAATCCACACTGTGCCC 
T C C C T T T TAT T T T T T T AAC T AAAAG AC AG AC AAAT T C C T A 



APP ID=1 0063735 



Page 239 of 322 



FIGURE 86 



MLLLLLPLLWGRERAEGQTSKLLTMQSSVTVQEGLCVHVPCSFSYPSHGWIYPGPWHGYWFREG 
ANTDQDAPVATNNPARAVWEETRDRFHLLGDPHTKNCTLS IRDARRSDAGRYFFRMEKGS IKWNY 
KHHRLSVKVTALTHRPNILIPGTLESGCPQNLTCSVPWACEQGTPPMISWIGTSVSPLDPSTTRS 
SVLTLIPQPQDHGTSLTCQVTFPGASVTTNKTVHLNVSYPPQNLTMTVFQGDGTVSTVLGNGSSL 
SLPEGQSLRLVCAVDAVDSNPPARLSLSWRGLTLCPSQPSNPGVLELPWVHLRDAAEFTCRAQNP 
LGSQQVYLNVSLQSKATSGVTQGWGGAGATALVFLSFCVIFWVRSCRKKSARPAAGVGDTGIE 
DANAVRGSASQGPLTEPWAEDSPPDQPPPASARS SVGEGELQYASLS FQMVKPWDSRGQEATDTE 
YSEIKIHR 

Signal peptide: 

amino acids 1-15 

Transmembrane domain: 

amino acids 351-370 
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\ y 



AGAAAGCTGCACTCTGTTGAGCTCCAGGGCGCAGTGGAGGGAGGGAGTGAAGGAGCTCTCTGTAC 
CCAAGGAAAGTGCAGC TGAGACT CAGACAAGAT TACAATGAACCAACT CAGCT TCC TGC TGT T T C 
TCATAGCGACCACCAGAGGATGGAGTACAGATGAGGCTAATACTTACTTCAAGGAATGGACCTGT 
TCTTCGTCTCCATCTCTGCCCAGAAGCTGCAAGGAAATCAAAGACGAATGTCCTAG.TGCATTTGA 
TGGCCTGTATTTTCTCCGCACTGAGAATGGTGTTATCTACCAGACCTTCTGTGACATGACCTCTG 
GGGGTGGCGGCTGGACCCTGGTGGCCAGCGTGCATGAGAATGACATGCGTGGGAAGTGCACGGTG 
GGCGATCGCTGGTCCAGTCAGCAGGGCAGCAAAGCAGACTACCCAGAGGGGGACGGCAACTGGGC 
CAACTACAACACCTTTGGATCTGCAGAGGCGGCCACGAGCGATGACTACAAGAACCCTGGGTACT 
ACGACATCCAGGCCAAGGACCTGGGCATCTGGCACGTGCCCAATAAGTCCCCCATGCAGCACTGG 
AGAAACAGCTCCCTGCTGAGGTACCGCACGGACACTGGCTTCCTCCAGACACTGGGACATAATCT 
GT TT GGCATCTACCAGAAAT AT CC AG T GAAATAT GGAGAAGGAAAGTGT TGGAC TGACAACGGC C 
CGGTGATCCCTGTGGTCTATGATTTTGGCGACGCCCAGAAAACAGCATCTTATTACTCACCCTAT 
GGCCAGCGGGAATTCACTGCGGGATTTGTTCAGTTCAGGGTATTTAATAACGAGAGAGCAGCCAA 
CGCCTTGTGTGGTGGAATGAGGGTCACCGGATGTAACACTGAGCATCACTGCATTGGTGGAGGAG 
GATACTTTCCAGAGGCCAGTCCCCAGCAGTGTGGAGATTTTTCTGGTTTTGATTGGAGTGGATAT 
GGAACTCATGTTGGTTACAGCAGCAGCCGTGAGATAACTGAGGCAGCTGTGCTTCTATTCTATCG 
T TGAG AG T T T TGTGGGAGGGAAC C CAGACCT C TC C TCCCAAC CAT GAGAT CCCAAGGATGGAGAA 
CAACTTACCCAGTAGCTAGAATGTTAATGGCAGAAGAGAAAACAATAAATCATATTGACTCAAGA 
AAAAAA 
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MNQLSFLLFLIATTRGWSTDEANTYFKEWTCSSSPSLPRSCKEIKDECPSAFDGLYFLRTENGVI 
YQTFCDMTSGGGGWTLVASVHENDMRGKCTVGDRWSSQQGSKADYPEGDGNWANYNTFGSAEAAT 
SDDYKNPGYYDIQAKDLGIWHVPNKSPMQHWRNSSLLRYRTDTGFLQTLGHNLFGIYQKYPVKYG 
EGKCWTDNGPVIPWYDFGDAQKTASYYSPYGQREFTAGFVQFRVFNNERAANALCAGMRVTGCN 
TEHHCIGGGGYFPEASPQQCGDFSGFDWSGYGTHVGYSSSREITEAAVLLFYR 

Important features: 
Signal peptide: 

amino acids 1-16 

N-glycosylation site. 

amino acids 163-167 

Glycosaminoglycan attachment sites. 

amino acids 74-78, 289-293 

N-myristoylation sites. 

amino acids 76-82, 115-121, 124-130, 253-259, 292-298 
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CTAGATTTGTCGGCTTGCGGGGAGACTTCAGGAGTCGCTGTCTCTGAACTTCCAGCCTCAGAGAC 
CGCCGCCCTTGTCCCCGAGGGCC ATGG GCCGGGTCTCAGGGCTTGTGCCCTCTCGCTTCCTGACG 
CT CCT GGCGCATC TGGTGGTCGT CATCACCTTAT TC TGGTCCCGGGACAGCAACATACAGGCCTG 
CCTGCCTCTCACGTTCACCCCCGAGGAGTATGACAAGCAGGACATTCAGCTGGTGGCCGCGCTCT 
CTGTCACCCTGGGCCTCTTTGCAGTGGAGCTGGCCGGTTTCCTCTCAGGAGTCTCCATGTTCAAC 
AGCACCCAGAGCCTCATCTCCATTGGGGCTCACTGTAGTGCATCCGTGGCCCTGTCCTTCTTCAT 
ATTCGAGCGTTGGGAGTGCACTACGTATTGGTACATTTTTGTCTTCTGCAGTGCCCTTCCAGCTG 
TCACTGAAATGGCTTTATTCGTCACCGTCTTTGGGCTGAAAAAGAAACCCTTCTGATTACCTTCA 
TGACGGGAACCTAAGGACGAAGCCTACAGGGGCAAGGGCCGCTTCGTATTCCTGGAAGAAGGAAG 
GCATAGGCTTCGGTTTTCCCCTCGGAAACTGCTTCTGCTGGAGGATATGTGTTGGAATAATTACG 
TCTTGAGTCTGGGATTATCCGCATTGTATTTAGTGCTTTGTAATAAAATATGTTTTGTAGTAACA 
T TAAGACTTATATACAGTT TTAGGGGACAATTAAAAAAAAAAAA 
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MGRVSGLVPSRFLTLIAHLVWITLFWSRDSNIQACLPLTFTPEEYDKQDIQLVAALSVTLGLFA 
VELAGFLSGVSMFNS TQSL I S I GAHCSASVALS FFI FERWECTTYWY I FVFCSALPAVTEMALFV 
TVFGLKKKPF 

Transmembrane domain: 

amino acids 12-28 (type II), 51-66, 107-124 
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C T GGGACCCC GAAAAGAGAAGGGGAGAGC GAGGGGAC GAGAGCGGAGGAGGAAGATGCAAC TGAC 
TCGCTGCTGCTTCGTGTTGCTGGTGCAGGGTAGCCTCTATCTGGTCATCTGTGGCCAGGATGATG 
GTCCTCCCGGCTCAGAGGACCCTGAGCGTGATGACCACGAGGGCCAGCCCCGGCCCCGGGTGCCT 
CGGAAGCGGGGCCACATCTCACCTAAGTCCCGCCCCATGGCCAATTCCACTCTCCTAGGGCTGCT 
GGCCCCGCCTGGGGAGGCTTGGGGCATTCTTGGGCAGCCCCCCAACCGCCCGAACCACAGCCCCC 
CACCC T CAGC CAAGGT GAAGAAAAT CTTTGGCT GGGGCGAC T T C TACT C CAAC AT C AAGAC GG T G 
GCCCTGAACCTGCTCGTCACAGGGAAGATTGTGGACCATGGCAATGGGACCTTCAGCGTCCACTT 
CCAACACAAT GCCACAGGCCAGGGAAACAT C TCCATCAGCC TCGT GCCC CCCAGTAAAGC TGTAG 
AGT TCCACCAGGAACAGCAGATCT T CAT CGAAGC C AAGGCC TCCAAAAT C T T CAAC T GCCGGATG 
GAGTGGGAGAAGGTAGAACGGGGCCGCGGGACCTCGCTTTGCACCCACGACCCAGCCAAGATCTG 
CTCCCGAGACCACGCTCAGAGCTCAGCCACCTGGAGCTGCTCCCAGCCCTTCAAAGTCGTCTGTG 
TCTACATCGCCTTCTACAGCACGGACTATCGGCTGGTCCAGAAGGTGTGCCCAGATTACAACTAC 
CATAGTGATACCCCCTACTACCCATCTGGGTGACCCGGGGCAGGCCACAGAGGCCAGGCCAGGGC 
TGGAAGGACAGGCCTGCCCATGCAGGAGACCATCTGGACACCGGGCAGGGAAGGGGTTGGGCCTC 
AG G CAG G G AG GGGGGT GGAGACG AG G AGAT G CCAAG T GG GGCC AGG GC CAAG T C TC AAG T GGCAG 
AGAAAGGGTCCCAAGTGCTGGTCCCAACCTGAAGCTGTGGAGTGACTAGATCACAGGAGCACTGG 
AGGAGGAGTGGGCTCTCTGTGCAGCCTCACAGGGCTTTGCCACGGAGCCACAGAGAGATGCTGGG 
TCCCCGAGGCCTGTGGGCAGGCCGATCAGTGTGGCCCCAGATCAAGTCATGGGAGGAAGCTAAGC 
CCTTGGTTCT T GC C AT C C TGAG GAAAGATAG CAACAGGGAGGG GGAGAT T T CAT CAG T G T GGAC A 
GC CT GT CAACT TAGGAT GGAT GGC T GAGAG GGC T T CC T AGGAGCCAGT CAGC AGGGT GGGG TGGG 
GCCAGAGGAGCTCTCCAGCCCTGCCTAGTGGGCGCCCTGAGCCCCTTGTCGTGTGCTGAGCATGG 
CATGAGGCTGAAGTGGCAACCCTGGGGTCTTTGATGTCTTGACAGATTGACCATCTGTCTCCAGC 
CAGGCCACCCCTTTCCAAAATTCCCTCTTCTGCCAGTACTCCCCCTGTACCACCCATTGCTGATG 
GCACACCCAT CC T TAAGC TAAGAC AGGACGAT TG TGGT CCT CC CACACTAAGGCCACAGC C CAT C 
CGCGTGCTGTGTGTCCCTCTTCCACCCCAACCCCTGCTGGCTCCTCTGGGAGCATCCATGTCCCG 
GAGAGGGGTCCCTCAACAGTCAGCCTCACCTGTCAGACCGGGGTTCTCCCGGATCTGGATGGCGC 
CGCCCTCTCAGCAGCGGGCACGGGTGGGGCGGGGCCGGGCCGCAGAGCATGTGCTGGATCTGTTC 
TGTGTGTCTGTCTGTGGGTGGGGGGAGGGGAGGGAAGTCTTGTGAAACCGCTGATTGCTGACTTT 
TGTGTGAAGAATCGTGTTCTTGGAGCAGGAAATAAAGCTTGCCCCGGGGCA 
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FIGURE Q2 

MQLTRCCFVFLVQGSLYLVlCGQDDGPPGSEDPERDDHEGQPRPRVPRKRGHISPKSRPMftNSTL 
LGLLAPPGEAWG I LGQP PNRPNHS PP PSAKVKK I FGWGDFYS NI KTVALNLLVT GK I VDHGNGT F 
SVHFQHNATGQGNISISLVPPSKAVEFHQEQQIFIEAKASKIFNCRMEWEKVERGRRTSLCTHDP 
AKICSRDHAQSSATWSCSQPFKWCVYIAFYSTDYRLVQKVCPDYNYHSDTPYYPSG 



Important features of the protein: 
Signal peptide: 

f W! H amino acids 1-14 

IM N-glycosylation sites. 

3 5 

;f;; amino acids 62-65, 127-130, 137-140, 143-146 

gjl 2-oxo acid dehydrogenases acyl transferase 

s amino acids 61-71 

y 

■3 5= 

Ujl 

;fin 
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FIGURE Q3 



CGGTGGCCATGACTGCGGCCGTGTTCTTCGGCTGCGCCTTCATTGCCTTCGGGCCTGCGCTCGCC 
CTTTATGTCTTCACCATCGCCATCGAGCCGTTGCGTATCATCTTCCTCATCGCCGGAGCTTTCTT 
CTGGTTGGTGTCTCTACTGATTTCGTCCCTTGTTTGGTTCATGGCAAGAGTCATTATTGACAACA 
AAGATGGACCAACACAGAAATATCTGCTGATCTTTGGAGCGTTTGTCTCTGTCTATATCCAAGAA 
ATG TT CC GAT T TGCATAT TAT AAACT C T T AAAAftAAGC CAGTGAAG GT T T GAAGAG TATAAACC C 
AGGTGAGACAGCACCC TC TATGCGAC TGCT GGCC TATG TT T C T GG C T T GG GCT T TGGAAT CAT GA 
GTGGAGTATTTTCCTTTGTGAATACCCTATCTGACTCCTTGGGGCCAGGCACAGTGGGCATTCAT 
GGAGATTCTCCTCAATTCTTCCTTTATTCAGCTTTCATGACGCTGGTCATTATCTTGCTGCATGT 
ATTCTGGGGCATTGTATTTTTTGATGGCTGTGAGAAGAAAAAGTGGGGCATCCTCCTTATCGTTC 
TCCTGACCCACCTGCTGGTGTCAGCCCAGACCTTCATAAGTTCTTATTATGGAATAAACCTGGCG 
TCAGCATTTATAATCCTGGTGCTCATGGGCACCTGGGCATTCTTAGCTGCGGGAGGCAGCTGCCG 
AAGCCTGAAACTCTGCCTGCTCTGCCAAGACAAGAACTTTCTTCTTTACAACCAGCGCTCCAGAT 
AACCTCAGGGAAC CAGCACT TCC CAAACCGCAGAC TACAT C T TT AGAGGAAGCACAACT GTGCCT 
T T TT CT GAAAAT C CC T T T T TC T GGTGGAATT GAGAAAGAAATAAAACTAT GCAGATA 
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FIGURE Q4 

MTAAVFFGCAFIAFGPALALYVFTIAIEPLRI I FLIAGAFFWLVSLLISSLWFMARVTIDNKDG 
PTQKYLLIFGAFVSVYIQEMFRFAYYKLLKKASEGLKSINPGETAPSMRLLAYVSGLGFGIMSGV 
FSFVNTLSDSLGPGTVGIHGDSPQFFLYSAFKTLVIILLHVFWGIVFFDGCEKKKWGILLIVLLT 
HLLVSAQTFISSYYGINLASAFIILVLMGTWAFLAAGGSCRSLKLCLLCQDKNFLLYNQRSR 

Important features of the protein: 
Signal peptide: 

amino acids 1-19 

|f| Transmembrane domains: 

UJ amino acids 32-51, 119-138, 152-169, 216-235 

Glycosaminoglycan attachment site. 

, g amino acids 120-123 

Ci 

Ml Sodium: neurotransmitter symporter family protein 

5 s .? amino acids 31-65 
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FIGURE Q5 



AAT T T T TCACC AGAG TAAAC T T GAGAAACCAAC T GGAC CT TGAG TAT T GT ACAT TTTGCCTCGTG 
GACCCAAAGGTAGCAATCTGAAACATGAGGAGTACGAT TC TACTGTT TT GTC TTCTAGGATCAAC 
TCGGTCATTACCACAGCTCAAACCTGCTTTGGGACTCCCTCCCACAAAACTGGCTCCGGATCAGG 
GAACACTACCAAACCAACAGCAGTCAAATCAGGT CTT TCC TTCT T TAAGTCT GATACCATTAACA 
CAGAT GC T C ACACT GGGGCCAGAT C T GCAT C TGT TAAATCC T GC T GCAGGAATGACACC T GG TAC 
CCAGACCCACCCATTGACCCTGGGAGGGTTGAATGTACAACAGCAACTGCACCCACATGTGTTAG 
CAATTTTTGTCACACAACTTGGAGCCCAGGGCACTATCCTAAGCTCAGAGGAATTGCCACAAATC 
TTCACGAGCCTCATCATCCATTCCTTGTTCCCGGGAGGCATCCTGCCCACCAGTCAGGCAGGGGC 
TAATCCAGATGTCCAGGATGGAAGCCTTCCAGCAGGAGGAGCAGGTGTAAATCCTGCCACCCAGG 
GAACCCCAGCAGGCCGCCTCCCAACTCCCAGTGGCACAGATGACGACTTTGCAGTGACCACCCCT 
GCAGGCATCCAAAGGAGCACACATGCCATCGAGGAAGCCACCACAGAATCAGCAAATGGAATTCA 
G TAAGC T G T T T CAAAT T T T T T CAAC TAAGC T GCC TCGAAT T TGG T GAT ACAT GTGAATC T T TAT C 
AT T GAT TAT AT T AT G G AAT AGAT T GAG AC AC AT T G GAT AG T C T T AG AAG AAAT T AAT T C T T AAT T 
T ACC T G AAAAT AT TC T T GAAAT T TC AGAAAATATGT TC TAT GT AGAGAAT CC CAAC TT T T AAAAA 
CAATAATTCAATGGATAAATCTGTCTTTGAAATATAACATTATGCTGCCTGGATGATATGCATAT 
T AAAACAT ATT TGGAAAAC TGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE Q6 



h 
D 
O 
CI! 
W 
SI 

w 
III 

0 

III 
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CI! 
CI 

ru 



MRSTILLFCLLGSTRSLPQLKPALGLPPTKLAPDQGTLPNQQQSNQVFPSLSLIPLTQM 
LTLGPDLHLLNPAAGMTPGTQTHPLTLGGLNVQQQLHPHVLPIFVTQLGAQGTILSSEE 
LPQIFTSLIIHSLFPGGILPTSQAGANPDVQDGSLPAGGAGVNPATQGTPAGRLPTPSG 
TDDDFAVTTPAGIQRSTHAIEEATTESANGIQ 

Signal peptide: 

amino acids 1-16 
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FIGURE Q7 



GCTCAAGTGCCCTGCCTTGCCCCACCCAGCCCAGCCTGGCCAGAGCCCCCTGGAGAAGGAGCTCT 
CTTCTTGCTTGGCAGCTGGACCAAGGGAGCCAGTCTTGGGCGCTGGAGGGCCTGTCCTGACCATG 
GTCCCTGCCTGGCTGTGGCTGCTTTGTGTCTCCGTCCCCCAGGCTCTCCCCAAGGCCCAGCCTGC 
AGAGCTGTCTGTGGAAGTTCCAGAAAACTATGGTGGAAATTTCCCTTTATACCTGACCAAGTTGC 
CGCTGCCCCGTGAGGGGGCTGAAGGCCAGATCGTGCTGTCAGGGGACTCAGGCAAGGCAACTGAG 
GGCCCATTTGCTATGGATCCAGATTCTGGCTTCCTGCTGGTGACCAGGGCCCTGGACCGAGAGGA 
GCAGGCAGAGTAC C AGC TACAGGT CACCC T G GAGATGCAGGATGGACAT GT C T TGT GGGGTCCAC 
AGCCTGTGCTTGTGCACGTGAAGGATGAGAATGACCAGGTGCCCCATTTCTCTCAAGCCATCTAC 
AGAGCTCGGCTGAGCCGGGGTACCAGGCCTGGCATCCCCTTCCTCTTCCTTGAGGCTTCAGACCG 
GGATGAGCCAGGCACAGCCAACTCGGATCTTCGATTCCACATCCTGAGCCAGGCTCCAGCCCAGC 
p* CTTCCCCAGACATGTTCCAGCTGGAGCCTCGGCTGGGGGCTCTGGCCCTCAGCCCCAAGGGGAGC 
Q ACCAGCCTTGACCACGCCCTGGAGAGGACCTACCAGCTGTTGGTACAGGTCAAGGACATGGGTGA 
P% CCAGGCCTCAGGCCACCAGGCCACTGCCACCGTGGAAGTCTCCATCATAGAGAGCACCTGGGTGT 
CCCTAGAGCCTATCCACCTGGCAGAGAATCTCAAAGTCCTATACCCGCACCACATGGCCCAGGTA 
M J CACTGGAGTGGGGGTGATGTGCACTATCACCTGGAGAGCCATCCCCCGGGACCCTTTGAAGTGAA 
Ul TGCAGAGGGAAACCTCTACGTGACCAGAGAGCTGGACAGAGAAGCCCAGGCTGAGTACCTGCTCC 
%J AGGTGCGGGCTCAGAATTCCCATGGCGAGGACTATGCGGCCCCTCTGGAGCTGCACGTGCTGGTG 
s 8 I ATGGATGAGAATGACAACGTGCCTATCTGCCCTCCCCGTGACCCCACAGTCAGCATCCCTGAGCT 
?2 CAGTCCACCAGGTACT GAAGTGAC TAGAC TG T CAGCAGAGGATGCAGAT GCCCCCGGC T CCCCCA 

^ ATTCCCACGTTGTGTATCAGCTCCTGAGCCCTGAGCCTGAGGATGGGGTAGAGGGGAGAGCCTTC 
* CAGGTGGACCCCACTTCAGGCAGTGTGACGCTGGGGGTGCTCCCACTCCGAGCAGGCCAGAACAT 
P CCTGCTTCTGGTGCTGGCCATGGACCTGGCAGGCGCAGAGGGTGGCTTCAGCAGCACGTGTGAAG 
m TCGAAGTCGCAGTCACAGATATCAATGATCACGCCCCTGAGTTCATCACTTCCCAGATTGGGCCT 
J:| ATAAGCCTCCCTGAGGATGTGGAGCCCGGGACTCTGGTGGCCATGCTAACAGCCATTGATGCTGA 
y CCTCGAGCCCGCC T TCCGCCTCATGGATT TT GCCATTGAGAGGGGAGACACAGAAGGGAC T TTTG 

Pi GCCTGGATTGGGAGCCAGACTCTGGGCATGTTAGACTCAGACTCTGCAAGAACCTCAGTTATGAG 
f*| GCAGCTCCAAGTCATGAGGTGGTGGTGGTGGTGCAGAGTGTGGCGAAGCTGGTGGGGCCAGGCCC 
i?3 « AGGCCCTGGAGCCACCGCCACGGTGACTGTGCTAGTGGAGAGAGTGATGCCACCCCCCAAGTTGG 
ACCAGGAGAGCTACGAGGCCAGTGTCCCCATCAGTGCCCCAGCCGGCTCTTTCCTGCTGACCATC 
CAGCCCTCCGACCCCATCAGCCGAACCCTCAGGTTCTCCCTAGTCAATGACTCAGAGGGCTGGCT 
CTGCATTGAGAAATTCTCCGGGGAGGTGCACACCGCCCAGTCCCTGCAGGGCGCCGAGCCTGGGG 
ACACCTACACGGTGCTTGTGGAGGCCCAGGATACAGCCCTGACTCTTGCCCCTGTGCCCTCCCAA 
TACCTCTGGACACCCCGCCAAGACCATGGCTTGATCGTGAGTGGACCCAGCAAGGACCCCGATCT 
GGCCAGTGGGCACGGTCCCTACAGCTTCACCCTTGGTCCCAACCCCACGGTGCAACGGGATTGGC 
GCCTCCAGACTCTCAATGGTTCCCATGCCTACCTCACCTTGGCCCTGCATTGGGTGGAGCCACGT 
GAACACATAATCCCCGTGGTGGTCAGCCACAATGCCCAGATGTGGCAGCTCCTGGTTCGAGTGAT 
CGTGTGTCGCTGCAACGTGGAGGGGCAGTGCATGCGCAAGGTGGGCCGCATGAAGGGCATGCCCA 
CGAAGCTGTCGGCAGTGGGCATCCTTGTAGGCACCCTGGTAGCAATAGGAATCTTCCTCATCCTC 
AT T T TCACCCAC T GGACCAT GT CAAGGAAGAAGGAC CC GGAT CAACCAGCAGACAGCG TGC CCCT 
GAAGGCGACTGTCTGAATGGCCCAGGCAGCTCTAGCTGGGAGCTTGGCCTCTGGCTCCATCTGAG 
TCCCCTGGGAGAGAGCCCAGCACCCAAGATCCAGCAGGGGACAGGACAGAGTAGAAGCCCCTCCA 
TCTGCCCTGGGGTGGAGGCACCATCACCATCACCAGGCATGTCTGCAGAGCCTGGACACCAACTT 
TAT GGAC T GC CCAT GGGAGTGCTCCAAAT GT CAGGGTGT T TGC CCAATAATAAAGCCCCAGAGAA 
CTGGGCTGGGCCCTATGGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAG 
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FIGURE Q8 

MVPAWLWLLCVSVPQALPKAQPAELSVEVPENYGGNFPLYLTKLPLPREGAEGQIVLSGDSGKAT 
EGPFAMDPDSGFLLVTRALDREEQAEYQLQVTLEMQDGHVLWGPQPVLVHVKDENDQVPHFSQAI 
YRARLSRGTRPGIPFLFLEASDRDEPGTANSDLRFHILSQAPAQPSPDMFQLEPRLGALALSPKG 
STSLDHALERT YQLLVQVKDMGDQAS GHQATATVE VS I IES TWS LEP I HLAENLKVL YPHHKAQ 
VHWSGGDVHYHLESHPPGPFEVNAEGNLYVTRELDREAQAEYLLQVRAQNSHGEDYAAPLELHVL 
VMDENDNVPICPPRDPTVSIPELSPPGTEVTRLSAEDADAPGSPNSHWYQLLSPEPEDGVEGRA 
FQVDPTSGSVTLGVLPLRAGQNILLLVLAMDLAGAEGGFSSTCEVEVAVTDINDHAPEFITSQIG 
PISLPEDVEPGTLVAMLTAIDADLEPAFRLMDFAIERGDTEGTFGLDWEPDSGHVRLRLCKNLSY 
EAAPSHEWVWQSVAKLVGPGPGPGATATVTVLVERVMPPPKLDQESYEASVPISAPAGSFLLT 
IQPSDPISRTLRFSLVNDSEGWLCIEKFSGEVHTAQSLQGAQPGDTYTVLVEAQDTALTLAPVPS 
QYLCTPRQDHGLIVSGPSKDPDLASGHGPYSFTLGPNPTVQRDWRLQTLNGSHAYLTLALHWVEP 
REHI IPVWSHNAQmQLLVRVIVCRC]Sr^GQCMRKVGRMKGMPTKLSAVGILVGTLVAIGI FLI 
LI FTHWTMSRKKDPDQPADS VPLKATV 

Signal peptide: 

amino acids 1-18 

Transmembrane domain: 

amino acids 7 62-784 
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FIGURE qq 



GGCTGACCGTGCTACATTGCCTGGAGGAAGCCTAAGGAACCCAGGCATCCAGCTGCCCACGCCTG 
AG T C CAAG AT T C T T C C C AG GAAC AC AAAC G T AG GAGAC CCACGC T C C T G G AAGCACC AGC C T T T A 
TCTCTTCACCTTCAAGTCCCCTTTCTCAAGAATCCTCTGTTCTTTGCCCTCTAAAGTCTTGGTAC 
ATCTAGGACCCAGGCATCTTGCTTTCCAGCCACAAAGAGACA GATGA AGATGCAGAAAGGAAATG 
T T C T C C T TAT GTTTGGTC T AC TAT T G C AT T T AGAAGC T GC AAC AAAT T C C AAT GAGAC TAG C AC C 
TCTGCCAACACTGGATCCAGTGTGATCTCCAGTGGAGCCAGCACAGCCACCAACTCTGGGTCCAG 
TGTGACCTCCAGTGGGGTCAGCACAGCCACCATCTCAGGGTCCAGCGTGACCTCCAATGGGGTCA 
GCATAG T CACCAACT C T GAGT TC CATACAACC T CCAGTGGGAT CAGCACAGC C AC CAACTCT GAG 
T T C AGCACAGCG T CCAGTGGGAT CAGCATAGCCACCAACTC TGAGTCCAGCACAACCTCCAGT GG 
GGCCAGCACAGCCACCAACTCTGAGTCCAGCACACCCTCCAGTGGGGCCAGCACAGTCACCAACT 
CTGGGTCCAGTGTGACCTCCAGTGGAGCCAGCACTGCCACCAACTCTGAGTCCAGCACAGTGTCC 
AGTAGGGC CAGCACTGCCACCAAC TC TGAGT C TAGCACAC T CT C CAGT GGGGCCAGCACAGCCAC 
CAACTCTGACTCCAGCACAACCTCCAGTGGGGCTAGCACAGCCACCAACTCTGAGTCCAGCACAA 
CCT CCAGT GGGGCCAGCACAGCCAC CAACT C TGAGTCCAGCACAGTGT C CAG TAGGGCCAGCACT 
GC CAC CAAC T CTGAGT CCAGCACAACC T CCAGTGGGGCCAGCACAGCCACCAAC T C TGAG T CCAG 
AACGACCTCCAATGGGGCTGGCACAGCCACCAACTCTGAGTCCAGCACGACCTCCAGTGGGGCCA 
GCACAGCCACCAACTCTGACTCCAGCACAGTGTCCAGTGGGGCCAGCACTGCCACCAACTCTGAG 
TCCAGCACGACCTCCAGTGGGGCCAGCACAGCCACCAACTCTGAGTCCAGCACGACCTCCAGTGG 
GGCTAGCACAGCCACCAACTCTGACTCCAGCACAACCTCCAGTGGGGCCGGCACAGCCACCAACT 
CT GAGTCCAGCACAGT G TCCAGT GGGATCAGCACAGT CACCAAT T CT GAGTCCAGCAC AC CCT CC 
AGTGGGGCCAACACAGCCACCAACTCTGAGTCCAGTACGACCTCCAGTGGGGCCAACACAGCCAC 
CAAC T C T GAG T C C AGC ACAG T G T C CAG T G G G GC C AGC AC T GC C AC CAAC T C T GAG T C CAGC ACAA 
CCTCCAGTGGGGTCAGCACAGCCACCAACTCTGAGTCCAGCACAACCTCCAGTGGGGCTAGCACA 
GCCACCAACTCTGACTCCAGCACAACCTCCAGTGAGGCCAGCACAGCCACCAACTCTGAGTCTAG 
CACAGTGTCCAGTGGGATCAGCACAGTCACCAATTCTGAGTCCAGCACAACCTCCAGTGGGGCCA 
ACACAGCCACCAAC TC TGGG TCCAGT GT GAC C TC TGCAGGC T CT GGAACAGCAGC TC TGAC TGGA 
ATGCACACAACTTCCCATAGTGCATCTACTGCAGTGAGTGAGGCAAAGCCTGGTGGGTCCCTGGT 
GCCGTGGGAAATCTTCCTCATCACCCTGGTCTCGGTTGTGGCGGCCGTGGGGCTCTTTGCTGGGC 
TC T TC T TCT GTGT GAGAAACAGCC T G TC CCT GAGAAACACC T TT AACACAGC T GT C TAC CACC C T 
CATGGCCTCAACCATGGCCTTGGTCCAGGCCCTGGAGGGAATCATGGAGCCCCCCACAGGCCCAG 
GT GGAGT CC TAAC T GG T TCT GGAG GAGAC CAG TAT CAT CGATAGCCAT GGAGATGAGCGGGAGGA 
ACAGCGGGCCCTGAGCAGCCCCGGAAGCAAGTGCCGCATTCTTCAGGAAGGAAGAGACCTGGGCA 
CCCAAGACCTGGTTTCCTTTCATTCATCCCAGGAGACCCCTCCCAGCTTTGTTTGAGATCCTGAA 
AATCTTGAAGAAGGTATTCCTCACCTTTCTTGCCTTTACCAGACACTGGAAAGAGAATACTATAT 
T G C T CAT T T AGC TAAG AAAT AAAT AC AT C T CAT C T AAC ACAC AC G AC AAAG AG AAG CTGTGCTTG 
CCCCGGGGTGGGTATCTAGCTCTGAGATGAACTCAGTTATAGGAGAAAACCTCCATGCTGGACTC 
CATCTGGCATTCAAAATCTCCACAGTAAAATCCAAAGACCTCAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 100 

MKMQKGNVLLMFGLLLHLEAATNS3STETSTSANTGSSVISSGASTATNSGSSVTSSGVSTATISGS 
SVTSNGVSIVTNSEFHTTSSGISTATNSEFSTASSGISIATNSESSTTSSGASTATNSESSTPSS 
GASTVTNSGSSVTSSGASTATNSESSTVSSRASTATNSESSTLSSGASTATNSDSSTTSSGASTA 
TNSESSTTSSGASTATNSESSTVSSRASTATNSESSTTSSGASTATNSESRTTSNGAGTATNSES 
STTSSGASTATNSDSSTVSSGASTATNSESSTTSSGASTATNSESSTTSSGASTATNSDSSTTSS 
GAG TATNSE S S TVS S G I S T VTNSE S S T P S S GANT ATNS E S S T T S S GANT ATNS ESS TVS S GAS TA 
TNSESSTTSSGVSTATNSESSTTSSGASTATNSDSSTTSSEASTATNSESSTVSSGISTVTNSES 
STTSSGANTATNSGSSVTSAGSGTAALTGMHTTSHSASTAVSEAKPGGSLVPWEIFLITLVSWA 
AVGLFAGLFFCVRNSLSLRNTFNTAVYHPHGLNHGLGPGPGGNHGAPHRPRWSPNWFWRRPVSSI 
AMEMSGRNSGP 

Signal peptide: 

amino acids 1-20 

Transmembrane domain: 

amino acids 510-532 
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FIGURE 101 



Ml 



GGCCGGACGCCTCCGCGTTACGGGATGAATTAACGGCGGGTTCCGCACGGAGGTTGTGACCCCTA 
CGGAGCCCCAGCTTGCCCACGCACCCCACTCGGCGTCGCGCGGCGTGCCCTGCTTGTCACAGGTG 
GGAGGCTGGAACTATCAGGCTGAAAAACAGAGTGGGTACTCTCTTCTGGGAAGCTGGCAACAAAT 
GGATGATGTGATATATGCATTCCAGGGGAAGGGAAATTGTGGTGCTTCTGAACCCATGGTCAATT 
AACGAGGCAGTTTCTAGCTACTGCACGTACTTCATAAAGCAGGACTCTAAAAGCTTTGGAATCAT 
GGTGTCATGGAAAGGGATTTACTTTATACTGACTCTGTTTTGGGGAAGCTTTTTTGGAAGCATTT 
TCATGCTGAGTCCCTTTTTACCTTTGATGTTTGTAAACCCATCTTGGTATCGCTGGATCAACAAC 
!!! CGCCTTGTGGCAACATGGCTCACCCTACCTGTGGCATTATTGGAGACCATGTTTGGTGTAAAAGT 
GATTATAAC T GGGGAT GCAT T TGT T CCTGGAGAAAGAAGT G T CAT TAT CAT GAACCAT CGGACAA 
g| GAATGGACTGGATGTTCCTGTGGAATTGCCTGATGCGATATAGCTACCTCAGATTGGAGAAAATT 
Uj TGCCTCAAAGCGAGTCTCAAAGGTGTTCCTGGATTTGGTTGGGCCATGCAGGCTGCTGCCTATAT 
y\ C T TCAT T C AT AGGAAATGGAAGGATGACAAGAGCCAT T TCGAAGACAT GAT TGAT TAC T T TTGTG 

ATAT T CACGAACCAC T TCAAC T CCTCATAT T CCCAGAAGGGAC TGAT C T CACAGAAAACAGCAAG 
T C T C GAAG T AAT GC AT T T GC T G AAAAAAAT G GAC T T C AGAAAT AT GAAT AT G T T T TAC AT C C AAG 

5= 

p AACTACAGGCTTTACTTTTGTGGTAGACCGTCTAAGAGAAGGTAAGAACCTTGATGCTGTCCATG 
Hi ATATCACTGTGGCGTATCCTCACAACATTCCTCAATCAGAGAAGCACCTCCTCCAAGGAGACTTT 
CC CAGGGAAATCCACTT TCACGT CCACCGGTATCCAATAGACACC C TC CC CACAT CCAAGGAGGA 
|J{ . CCTTCAACTCTGGTGCCACAAACGGTGGGAAGAGAAAGAAGAGAGGCTGCGTTCCTTCTATCAAG 
fj{ GGGAGAAGAAT T T T TAT T T TACC GGACAGAGTGTCAT T CCAC CT T GCAAG TC TGAAC TCAGGGTC 

CTTGTGGTCAAATTGCTCTCTATACTGTATTGGACCCTGTTCAGCCCTGCAATGTGCCTACTCAT 
ATAT T T GTACAGT C TT G T TAAGT GGTAT T T T AT AATCACCAT TG T AAT CTTTGTGCT GCAAGAGA 
GAAT ATT TGGT GGACTGGAGAT CAT AGAACT TGCATGT TACC GAC T T T TACACAAACAGCCACAT 
T TAAAT TCAAAGAAAAAT GAG TAA GAT TATAAGG T T TGCCAT GT GAAAAC C T AGAGCATAT T T TG 
GAAATGTTCTAAACCTTTCTAAGCTCAGATGCATTTTTGCATGACTATGTCGAATATTTCTTACT 
GCCATCATTATTTGTTAAAGATATTTTGCACTTAATTTTGTGGGAAAAATATTGCTACAATTTTT 
TTTAATCTCTGAATGTAATTTCGATACTGTGTACATAGCAGGGAGTGATCGGGGTGAAATAACTT 
GGGCCAGAATAT TAT T AAAC AAT CAT CAGGCT TT TAAA 
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FIGURE 102 

MHSRGRE I WLLNPWS INEAVS S YCT YFI KQDS KS FG IMVS WKG I YFI LT L FWGS FFGS I FMLS P 
FLPLMFVNPSWYRWINNRLVATWLTLPVALLETMFGVKVI I TGDAFVPGERSVI IMNHRTRMDWM 
FLWNCLMRYSYLRLEKICLKASLKGVPGFGWAMQAAAYIFIHRKWKDDKSHFEDMIDYFCDIHEP 
LQLLI FPEGTDLTENSKSRSNAFAEKNGLQKYEYVLHPRTTGFTFWDRLREGKNLDAVHDI TVA 
YPHNI PQSEKHLLQGDFPRE IHFHVHRYPI DTLPTSKEDLQLWCHKRWEEKEERLRS FYQGEKNF 
YFTGQSVIPPCKSELRVLWKLLSILYWTLFSPAMCLLIYLYSLVKWYFIITIVIFVLQERIFGG 
LE I IELACYRLLHKQPHLNS KKNE 

Important features of the protein: 
Signal peptide: 

amino acids 1-22 

Transmembrane domains: 

amino acids 44-63, 90-108, 354-377 
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FIGURE 103 



CGGCTCGAGCGGCTCGAGTGAAGAGCCTCTCCACGGCTCCTGCGCCTGAGACAGCTGGCCTGACC 
TCCAAATCATCCATCCACCCCTGCTGTCATCTGTTTTCATAGTGTGAGATCAACCCACAGGAATA 
TCC ATG GCTTTTGTGCTCATTTTGGTTCTCAGTTTCTACGAGCTGGTGTCAGGACAGTGGCAAGT 
CACTGGACCGGGCAAGTTTGTCCAGGCCTTGGTGGGGGAGGACGCCGTGTTCTCCTGCTCCCTCT 
TTCCTGAGACCAGTGCAGAGGCTATGGAAGTGCGGTTCTTCAGGAATCAGTTCCATGCTGTGGTC 
CACCTCTACAGAGATGGGGAAGACTGGGAATCTAAGCAGATGCCACAGTATCGAGGGAGAACTGA 
GTTTGTGAAGGACTCCATTGCAGGGGGGCGTGTCTCTCTAAGGCTAAAAAACATCACTCCCTCGG 
ACATCGGCCTGTATGGGTGCTGGTTCAGTTCCCAGATTTACGATGAGGAGGCCACCTGGGAGCTG 
CGGGTGGCAGCACTGGGCTCACTTCCTCTCATTTCCATCGTGGGATATGTTGACGGAGGTATCCA 
GTTACTCTGCCTGTCCTCAGGCTGGTTCCCCCAGCCCACAGCCAAGTGGAAAGGTCCACAAGGAC 
AGGATTTGTCTTCAGACTCCAGAGCAAATGCAGATGGGTACAGCCTGTATGATGTGGAGATCTCC 
ATTATAGTCCAGGAAAATGCTGGGAGCATATTGTGTTCCATCCACCTTGCTGAGCAGAGTCATGA 
GGTGGAATCCAAGGTATTGATAGGAGAGACGTTTTTCCAGCCCTCACCTTGGCGCCTGGCTTCTA 
TTTTACTCGGGTTACTCTGTGGTGCCCTGTGTGGTGTTGTCATGGGGATGATAATTGTTTTCTTC 
AAAT CCAAAGGGAAAAT CCAGGCGGAAC T GGACT GGAGAAGAAAGCAC GG ACAGGCAGAAT TGAG 
AGACGCCCGGAAACACGCAGTGGAGGTGACTCTGGATCCAGAGACGGCTCACCCGAAGCTCTGCG 
TTTCTGATCTGAAAACTGTAACCCATAGAAAAGCTCCCCAGGAGGTGCCTCACTCTGAGAAGAGA 
TTTACAAGGAAGAGTGTGGTGGCTTCTCAGGGTTTCCAAGCAGGGAGACATTACTGGGAGGTGGA 
CGTGGGACAAAATGTAGGGTGGTATGTGGGAGTGTGTCGGGATGACGTAGACAGGGGGAAGAACA 
ATGTGACTTTGTCTCCCAACAATGGGTATTGGGTCCTCAGACTGACAACAGAACATTTGTATTTC 
ACATTCAATCCCCATTTTATCAGCCTCCCCCCCAGCACCCCTCCTACACGAGTAGGGGTCTTCCT 
GGACTATGAGGGTGGGACCATCTCCTTCTTCAATACAAATGACCAGTCCCTTATTTATACCCTGC 
T GACAT GTCAGT T TGAAGGC TTGT T GAGACCCTATATCCAGCATGCGAT GTAT GACGAGGAAAAG 
GGGACT CCCATAT TCATATG TCCAGT GT CCT GGGGATGAGACAGAGAAGACCC TGCT TAAAGGGC 
CCCACACCACAGACCCAGACACAGCCAAGGGAGAGTGCTCCCGACAGGTGGCCCCAGCTTCCTCT 
CCGGAGCCTGCGCACAGAGAGTCACGCCCCCCACTCTCCTTTAGGGAGCTGAGGTTCTTCTGCCC 
TGAGCCCTGCAGCAGCGGCAGTCACAGCTTCCAGATGAGGGGGGATTGGCCTGACCCTGTGGGAG 
TCAGAAGCCATGGC TGCCCT GAAGT GGGGACGGAATAGAC T CACAT T AGG T TTAGT T TGTGAAAA 
CTCCATCCAGCTAAGCGATCTTGAACAAGTCACAACCTCCCAGGCTCCTCATTTGCTAGTCACGG 
ACAGTGAT TCC TGCC TCACAGGTGAAGAT TAAAGAGACAACGAAT GT GAATCATGC T TGCAGGTT 
TGAGGGCACAGTGTTTGCTAATGATGTGTTTTTATATTATACATTTTCCCACCATAAACTCTGTT 
TGCT TAT T C CACAT T AAT T T AC T T T T C T C TAT AC CAAAT C AC C CAT G G AAT AG T TAT T GAAC AC C 
TGC TT T GTGAGGC TCAAAGAATAAAGAGGAGGTAGGAT T TT T CACT GAT T CTATAAGCCCAGCAT 
TAC CT GATAC CAAAACCAGGCAAAGAAAACAGAAGAAGAGGAAGGAAAAC T ACAGG T CCATATCC 
C T CAT T AAC AC AGACACAAAAAT T C TAAAT AAAAT T T T AACAAAT T AAAC T AAAC AAT AT AT T T A 
AAGAT GATATATAAC TAC TCAGT GTGGTTTGTCC CACAAATGCAGAGT T GGT T TAATAT T TAAAT 
AT CAAC CAGT GTAAT TCAGCACATTAATAAAGTAAAAAAGAAAACCAT AAAAAAAAAAAAAAA 
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FIGURE too. 

MAFVLILVLSFYELVSGQWQVTGPGKFVQALVGEDAVFSCSLFPETSAEAMEVRFFRNQFHAWH 
LYRDGEDWESKQMPQYRGRTEFVKDSIAGGRVSLRLKNITPSDIGLYGCWFSSQIYDEEATWELR 
VAALGSLPLISIVGYVDGGIQLLCLSSGWFPQPTAKWKGPQGQDLSSDSRANADGYSLYDVEISI 
IVQENAGSILCSIHLAEQSHEVESKVLIGETFFQPSPWRLASILLGLLCGALCGWMGMIIVFFK 
SKGKIQAELDWRRKHGQAELRDARKHAVEVTLDPETAHPKLCVSDLKTVTHRKAPQEVPHSEKRF 
TRKSWASQGFQAGRHYWEVDVGQNVGWYVGVCRDDVDRGKNNVTLSPNNGYWVLRLTTEHLYFT 
FNPHFISLPPSTPPTRVGVFLDYEGGTISFFNTNDQSLIYTLLTCQFEGLLRPYIQHAMYDEEKG 
TPIFICPVSWG 

Signal peptide: 

amino acids 1-17 

Transmembrane domains: 

amino acids 131-150, 235-259 
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FIGURE 105 



CCTTCACAGGACTCTTCATTGCTGGTTGGC AATGA TGTATCGGCCAGATGTGGTGAGGGCTAGGAAAAGAG 
TTTGTTGGGAACCCTGGGTTATCGGCCTCGTCATCT'TCATATCCCTGATTGTCCTGGCAGTGTGCATTGGA 
CT CACTGT T C AT T ATGT GAGAT AT AAT C AAAAGAAG AC CT AC AAT T ACT AT AGC AC AT T G TCAT TT AC AAC 
T G AC AAACT AT ATGCT GAGT TT GGC AGAGAGGCT TC T AAC AATTT T AC AG AAAT GAGCC AGAGACT T G AAT 
C AAT GGTG AAAAAT GC AT T T TAT AAAT CT C C AT T AAGGGAAG AAT T TGT C AAGT CT CAGGTT AT C AAGT TC 
AGTC AACAGAAGCAT GGAGT GT TGGC TC AT AT G CT GTTG ATT TGT AGAT T TC AC TCT ACT GAGGAT CCTGA 
AACT GT AGAT AAAAT T GT TC AACT T GTT T T AC AT GAAAAG CT GCAAGATGCT GT AGGACCC CCT AAAGT AG 
jL! ? ATCCTCACTCAGTTAAAATTAAAAAAATCAACAAGACAGAAACAGACAGCTATCTAAACCATTGCTGCGGA 
Q AC ACGAAG AAGT AAAACT CT AGGT CAGAGT CT C AGG AT CGTT G GTGGG AC AG AAGT AGAAGAGGGT GAAT G 

Q GCCCTGGCAGGCTAGCCTGCAGTGGGATGGGAGTCATCGCTGTGGAGCAACCTTAATTAATGCCACATGGC 
IP 1 TTGTGAGTGCTGCTCACTGTTTTACAACATATAAGAACCCTGCCAGATGGACTGCTTCCTTTGGAGTAACA 
AT AAAACC TT CGAAAAT G AAACGGGGTCTCCGGAG AAT AATT GT C CAT G AAAAAT ACAAACACCC AT C AC A 
!: 1 T G ACT ATGAT ATTT CT CTTGCAGAGC TT TCT AGC CC T GT T CC CT AC AC AAAT GC AGT ACAT AGAGT T TGT C 

k ;j : 

ill T C CCTGAT GC AT CCT ATGAGTT TC AACC AG GT GATGT G AT GT T T GT GACAGG AT TTGG AGCACT GAAAAAT 

GATGGTT ACAGTCAAAATCATCTTCGACAAGCACAGGTGACT CT C AT AGACGCT ACAACT TGCAAT GAAC C 

W 

r-| T C AAGCTT AC AATGACGCCATAACT C CT AG AAT GT T AT GT GCTGGCT CCT T AGAAGGAAAAACAGATGCAT 

If! GCCAGGGTGACTCTGGAGGACCACTGGTTAGTTCAGATGCTAGAGATATCTGGTACCTTGCTGGAATAGTG 
O AGCTGGGGAGATGAATGTGCGAAACCCAACAAGCCTGGTGTTTATACTAGAGTTACGGCCTTGCGGGACTG 
EI| GATTACTTCAAAAACTGGTATC TAA GAGACAAAAGCCTCATGGAACAGATAACATTTTTTTTTGTTTTTTG 
GGTGT GGAGGCC AT TTTT AG AGAT AC AG AATT GG AG AAG ACT T G CAAAAC AGCT AGAT TT GACT GAT CTC A 

rs? 

ATAAACTGTTTGCTTGATGCATGTATTTTCTTCCCAGCTCTGTTCCGCACGTAAGCATCCTGCTTCTGCCA 
GATC AAC T CT GTC ATC TGTGAGC AAT AGT T GAAACT TT AT GT AC AT AG AG AAAT AG AT AAT AC AAT AT T AC 
ATTACAGCCTGTATTCATTTGTTCTCTAGAAGTTTTGTCAGAATTTTGACTTGTTGACATAAATTTGTAAT 
GCAT AT AT ACAATTTGAAGCACTCCTTT TCTT CAGTTCCTCAGCTCCTCTCATT TC AGC AAAT ATCC ATT T 
TCAAGGTGCAGAACAAGGAGTGAAAGAAAATATAAGAAGAAAAAAATCCCCTACATTTTATTGGCACAGAA 
AAGT ATTAGGTGTTTTTCTTAGTGGAATATTAGAAATGAT CAT ATT CATTAT GAAAGGTC AAGC AAAGACA 
GCAGAATACCAATCACTTCATCATTTAGGAAGTATGGGAACTAAGTTAAGGAAGTCCAGAAAGAAGCCAAG 
ATATATCCTTATTTTCATTTCCAAACAACTACTATGATAAATGTGAAGAAGATTCTGTTTTTTTGTGACCT 
ATAATAATTATACAAACTTCATGCAATGTACTTGTTCTAAGCAAATTAAAGCAAATATTTATTTAACATTG 
T TACT GAGGATGTCAACATATAAC AAT AAAAT AT AAAT C ACCC A 
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FIGURE 106 

MMYRPDWRARKRVCWE PWVI GLVIFI S L I VLAVC I GL T VHYVRYNQKKT YNY YS T LS FT TDKL Y 
AEFGREASNNFTEMSQRLESMVKNAFYKSPLREEFVKSQVIKFSQQKHGVLAHMLLICRFHSTED 
PETVDKIVQLVLHEKLQDAVGPPKVDPHSVKIKKINKTETDSYLNHCCGTRRSKTLGQSLRIVGG 
TEVEEGEWPWQASLQWDGSHRCGATLINATWLVSAAHCFTTYPCNPARWTASFGVTIKPSKMKRGL 
RRIIVHEKYKHPSHDYDISLAELSSPVPYTNAVHRVCLPDASYEFQPGDVMFVTGFGALKNDGYS 
QNHLRQAQVTLIDATTCNEPQAYNDAITPRMLCAGSLEGKTDACQGDSGGPLVSSDARDIWYLAG 
IVSWGDECAKPNKPGVYTRVTALRDWITSKTGI 

f*\ Transmembrane domain: 

Q) amino acids 21-40 (type II) 

w 

If! 
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FIGURE 107 



AGAGAAAGAAGCGTCTCCAGCTGAAGCCAATGCAGCCCTCCGGCTCTCCGCGAAGAAGTTCCCTG 
CCCCGATGAGCCCCCGCCGTGCGTCCCCGACTATCCCCAGGCGGGCGTGGGGCACCGGGCCCAGC 
GCCGACGATCGCTGCCGTTTTGCCCTTGGGAGTAGGATGTGGTGAAAGGATGGGGCTTCTCCCTT 
ACGGGGCTCACAATGGCCAGAGAAGATTCCGTGAAGTGTCTGCGCTGCCTGCTCTACGCCCTCAA 
TCTGCTCTTTTGGTTAATGTCCATCAGTGTGTTGGCAGTTTCTGCTTGGATGAGGGACTACCTAA 
AT AATG TT CT CAC TT TAAC T GCAGAAAC GAGGGTAGAGGAAGCAG TC ATTT T GAC T TAC T T TCCT 
GTGGTTCATCCGGTCATGATTGCTGTTTGCTGTTTCCTTATCATTGTGGGGATGTTAGGATATTG 
TGGAACGGTGAAAAGAAATCTGTTGCTTCTTGCATGGTACTTTGGAAGTTTGCTTGTCATTTTCT 
GTGTAGAACTGGCTTGTGGCGTTTGGACATATGAACAGGAACTTATGGTTCCAGTACAATGGTCA 
GATAT GGTCACT T TGAAAGCCAGGATGACAAATTATGGATTACCTAGATATCGGTGGCT tactca 
[** T GC T T GGAAT TT T T T T CAGAGAGAGT T TAAGTGC T GTGGAG TAG TAT AT T T CACT GAC T GGT TGG 

CI AAATGACAGAGATGGACTGGCCCCCAGATTCCTGCTGTGTTAGAGAATTCCCAGGATGTTCCAAA 
O CAGGCCCACCAGGAAGATCTCAGTGACCTTTATCAAGAGGGTTGTGGGAAGAAAATGTATTCCTT 
ffj T TT GAGAGGAACCAAACAAC TGCAGGTGCT GAGGT TT C TGGGAAT CT CCAT T GGGGTGACACAAA 

T I TCCTGGCCATGATTCTCACCATTACTCTGCTCTGGGCTCTGTATTATGATAGAAGGGAGCCTGGG 
f'* ACAGACCAAATGATGTCCTTGAAGAATGACAACTCTCAGCACCTGTCATGTCCCTCAGTAGAACT 
yi GT TGAAAC CAAGCCTG T CAAGAAT CT T T GAACACACAT CCAT GGCAAACAGCT TTAATACACACT 

yj T T GAGATGGAGGAGT T ATAAA AAGAAAT GT CACAGAAGAAAACCACAAACTT GT T T TAT T GGACT 

|J1 TGTGAAT TT T TGAGTACATAC TAT GT GTT TCAGAAATATGTAGAAATAAAAAT GT TGCCATAAAA 

TAAC AC C T AAGC ATAT AC TAT T C TAT GC T T T AAAATGAGGAT GG AAAAG T T T CAT G TC AT AAG T C 
ACCACCTGGACAATAATTGATGCCCTTAAAATGCTGAAGACAGATGTCATACCCACTGTGTAGCC 
W TGT GT AT GAC TT T TAC T GAACACAGT TAT GT T TT GAGGCAGCAT GGTT T GAT TAGCAT T T CC GCA 

111 T CCAT GCAAACGAG TCACATATGGT GGGAC TGGAGCCAT AGTAAAGG T T GAT T TAC T TC TACCAA 

p C TAGT ATAT AAAG TAC TAAT TAAAT GCT AAC ATAGGAAG T T AGAAAAT ACT AAT AAC T T T TAT TA 

X C T C AG C GAT C TAT T C T T C T GAT G C T AAAT AAAT T ATAT AT C AG AAAAC T T T C AAT AT T G G T GAC T 

ACCTAAATGTGATTTTTGCTGGTTACTAAAATATTCTTACCACTTAAAAGAGCAAGCTAACACAT 
y TGTCTTAAGCTGATCAGGGATTTTTTGTATATAAGTCTGTGTTAAATCTGTATAATTCAGTCGAT 
f|j T T C AG T T C T GAT AAT G T T AAGAAT AAC CAT TAT GAAAAG GAAAAT TTGTCCTG TAT AG CAT CAT T 

ATTTTTAGCCTTTCCTGTTAATAAAGCTTTACTATTCTGTCCTGGGCTTATATTACACATATAAC 
T G T T AT T TAAAT AC T TAAC CAC TAAT T T T GAAAAT TAC C AG TGT GAT AC AT AG GAAT CAT T AT T C 
AGAAT G T AG TC TGGTC T T TAGGAAGTAT TAATAAGAAAAT TT GCACAT AACT TAGT T GAT TCAGA 
AAGGAC T TGTATGC TG T T T T TCT CCCAAATGAAGAC TC T T T T T GAC AC T AAAC AC T T T T TAAAAA 
GCT TAT CTT TGCC T TC TCCAAACAAGAAGCAATAG TCT CCAAGTCAATAT AAAT TC TACAGAAAA 
TAG TGTTCTTTTTCTC C AGAAAAAT G C T T G T GAGAAT C AT T AAAAC AT G T GAC AAT T TAG AG AT T 
CTTTGTTT TAT T T CAC T GAT T AATATAC TGT GGC AAAT TAC AC AG AT TAT TAAAT T T T T T TAC AA 
GAGTATAGTATATTTATTTGAAATGGGAAAAGTGCATTTTACTGTATTTTGTGTATTTTGTTTAT 
T TC TCAGAAT AT GGAAAGAAAAT TAAAATGTGTCAATAAAT AT TT T C T AGAGAG TAA 
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FIGURE 108 

MAREDSVKCLRCLLYALNLLFWIiMS I SVLJW'SAWRDYLNNVLTLTAETRVEEAVILT YFPVVHP 
VMIAVCCFLIIVGMLGYCGTWRNLLLIAWYFGSLL^^ 

LKARMTNYGLPRYRWLTHAWNFFQRE FKCCGWYFTDWLEMTEMDWPPDSCCVREFPGCSKQAHQ 
EDLSDLYQEGCGKKMYSFLRGTKQLQVLRFLGISIGVTQILAMILTITLLWALYYDRREPGTDQM 
MSLKNDNSQHLSCPSVELLKPSLSRIFEHTSMANSFNTHFEMEEL 

Signal peptide; 

amino acids 1-33 

Transmembrane domains: 

amino acids 12-35, 57-86, 94-114, 226-248 
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FIGURE 1QQ 



CCAAGGCCAGAGCTGTGGACACCTTATCCCACTCATCCTCATCCTCTTCCTCTGATAAAGCCCCTACCAGTGCT 
GATAAAGTCTTTCTCGTGAGAGCCTAGAGGCCTTAAAAAAAAAAGTGCTTGAAAGAGAAGGGGACAAAGGAACA 
CCAGTATTAAGAGGATTTTCCAGTGTTTCTGGCAGTTGGTCCAGAAGG ATGC CTCCATTCCTGCTTCTCACCTG 
CCTCTTCATCACAGGCACCTCCGTGTCACCCGTGGCCCTAGATCCTTGTTCTGCTTACATCAGCCTGAATGAGC 
CCTGGAGGAACACTGACCACCAGTTGGATGAGTCTCAAGGTCCTCCTCTATGTGACAACCATGT GAATGGGGAG 
T GGT ACC ACTT C ACGGGC AT GGC GGGAGAT GCCAT GCCT ACCTTCT GC AT AC C AGAAAACC ACT GT GGAACC CA 
CGCACCTGTCTGGCTCAATGGCAGCCACCCCCTAGAAGGCGACGGCATTGTGCAACGCCAGGCTTGTGCCAGCT 
t . TCAATGGGAACTGCTGTCTCTGGAACACCACGGTGGAAGTCAAGGCTTGCCCTGGAGGCTACTATGTGTATCGT 
W CTGACCAAGCCCAGCGTCTGCTTCCACGTCTACTGTGGTCATTTTTATGACATCTGCGACGAGGACTGCCATGG 
?*\ C AGCT GC T C AG AT AC C AGC GAGT GCAC AT GC GCT C C AGGAAC T GT GCT AGGCCCT GAC AGGCAGACAT GCTT TG 

iff* AT G AAAAT GAAT GT G AGC AAAAC AAC GGT GGCT GC AGT GAGAT C T GT GT GAACC T C AAAAAC T C CT AC C GCT GT 

US' 5 

fy GAGTGTGGGGTTGGCCGTGTGCTAAGAAGTGATGGCAAGACTTGTGAAGACGTTGAAGGATGCCACAATAACAA 
SI TGGTGGCTGCAGCCACTCTTGCCTTGGATCTGAGAAAGGCTACCAGTGTGAATGTCCCCGGGGCCTGGTGCTGT 
|ll CTGAGGATAACCACACTTGCCAAGTCCCTGTGTTGTGCAAATCAAATGCCATTGAAGTGAACATCCCCAGGGAG 
Iff CTGGTTGGTGGCCTGGAGCTCTTCCTGACCAACACCTCCTGCCGAGGAGTGTCCAACGGCACCCATGTCAACAT 
S C CT C T T C TCT CT C AAGAC AT GT GGT AC AGT G GTC GAT GT GGT GAAT G ACAAGAT T G T GGC C AGC AAC C TC GT GA 

B CAGGTCTACCCAAGCAGACCCCGGGGAGCAGCGGGGACTTCATCATCCGAACCAGCAAGCTGCTGATCCCGGTG 
Ul ACCTGCGAGTTTCCACGCCTGTACACCATTTCTGAAGGATACGTTCCCAACCTTCGAAACTCCCCACTGGAAAT 
*al CATGAGCCGAAATCATGGGATCTTCCCATTCACTCTGGAGATCTTCAAGGACAATGAGTTTGAAGAGCCTTACC 
W GGGAAGCTCTGCCCACCCTCAAGCTTCGTGACTCCCTCTACTTTGGCATTGAGCCCGTGGTGCACGTGAGCGGC 
TTGGAAAGCTTGGTGGAGAGCTGCTTTGCCACCCCCACCTCCAAGATCGACGAGGTCCTGAAATACTACCTCAT 
2 ^ CCGGGATGGCTGTGTTTCAGATGACTCGGTAAAGCAGTACACATCCCGGGATCACCTAGCAAAGCACTTCCAGG 
TCCCTGTCTTCAAGTTTGTGGGCAAAGACCACAAGGAAGTGTTTCTGCACTGCCGGGTTCTTGTCTGTGGAGTG 
TT GG AC G A GC GT T CC C GCT GT GC C CAG GGT T GCCACC GGC GAAT GC GT C GT GGGGC AGGAGGAG AGGACT C AGC 
CGGTCTACAGGGCCAGACGCTAACAGGCGGCCCGATCCGCATCGACTGGGAGGACTAGTTCGTAGCCATACCTC 
GAGTCCCTGCATTGGACGGCTCT GCTCTTTGGAGCTTCTCCCCCCACCGCCCTCTAAGAACATCTGCCAACAGC 
TGGGTTCAGACTTCACACTGTGAGTTCAGACTCCCAGCACCAACTCACTCTGATTC'TGGTCCATTCAGTGGGCA 
CAGGTCACAGCACTGCTGAACAATGTGGCCTGGGT GGGGTTTCATCTTTCTAGGGTTGAAAACTAAACTGTCCA 
CCCAGAAAGACACTCACCCCATTTCCCTCATTTCTTTCCTACACTTAAATACCTCGTGTATGGTGCAATCAGAC 
CAC AAAAT CAGAAGC T GGGTATAATAT TTCAAGTT ACAAACCCTAGAAAAATT AAACAGT TACT GAAATT AT GA 
CTTAAATACCCAATGACTCCTTAAATATGTAAATTATAGTTATACCTTGAAATTTCAATTCAAATGCAGACTAA 
TTATAGGGAATTT GGAAGT GTAT CAATAAAACAGT ATATAATTTT 
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FIGURE no 



MPPFLLLTCLFITGTSVSPVALDPCSAYISLNEPWRNTDHQLDESQGPPLCDNHVNGEWYHFTGMAGDAMP 
TFCIPENHCGTHAPVWLNGSHPLEGDGIVQRQACASFNGNCCLWNTTVEVKACPGGYYVYRLTKPSVCFHV 
YCGHFYDICDEDCHGSCSDTSECTCAPGTVLGPDRQTCFDENECEQNNGGCSEICVNLKNSYRCECGVGRV 
LRSDGKTCEDVEGCHNNNGGCSHSCLGSEKGYQCECPRGLVLSEDNHTCQVPVLCKSNAIEVNIPRELVGG 
LELFLTNTSCRGVSNGTHVNILFSLKTCGTWDWNDKIVASNLVTGLPKQTPGSSGDFIIRTSKLLIPVT 
CEFPRLYTISEGYVPNLRNSPLEIMSRNHGIFPFTLEIFKDNEFEEPYREALPTLKLRDSLY FGIEPWHV 
SGLESLVESCFATPTSKIDEVLECYYLIRDGCVSDDSVKQYTSRDHLAKHFQVPVFKFVGKDHKEVFLHCRV 
LVCGVLDERSRCAQGCHRRMRRGAGGEDSAGLQGQTLTGGPIRIDWED 

Important features of the protein: 
Signal peptide: 
amino acids 1-16 

N-glycosylation sites. 

amino acids 89-93, 116-120, 259-263, 291-295, 299-303 
Tyrosine kinase phosphorylation sites. 



Q amino acids 411-418, 443-451 

pi 



N-rayristoylation sites. 

amino acids 226 
522-528, 531-537 



IW amino acids 226-232, 233-239, 240-246, 252-258, 296-302, 300-306, 



Aspartic acid and asparagine hydroxylation site. 

amino acids 197-209 

ZP domain proteins. 

amino acids 431-457 

Calcium-binding EGF-like proteins. 

amino acids 191-212, 232-253 
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FIGURE 111 



GAGAGAGGCAGCAGCTTGCTCAGCGGACAAGGATGCTGGGCGTGAGGGACCAAGGCCTGCCCTGCACTCGG 
GCCTCCTCCAGCCAGTGCTGACCAGGGACTTCTGACCTGCTGGCCAGCCAGGACCTGTGTGGGGAGGCCCT 
CCTGCTGCCTTGGGGTGACAATCTCAGCTCCAGGCTACAGGGAGACCGGGAGGATCACAGAGCCAGCATGT 
TACAGGATCCTGACAGTGATCAACCTCTGAACAGCCTCGATGTCAAACCCCTGCGCAAACCCCGTATCCCC 
AT GGAGACCT TC AGAAAGGT GGGGATCCCC ATCATC AT AGC ACT ACTGAGCCTGGCGAGTATCATCATTGT 
GGTTGTCCTCATCAAGGTGATTCTGGATAAATACTACTTCCTCTGCGGGCAGCCTCTCCACTTCATCCCGA 
GGAAGC AGCT GT GT GACGG AGAGCT GGACT GT CC CT TGGGGGAGGACGAGGAGC ACT GTGT C AAGAGCTT C 
y s CCCGAAGGGCCTGCAGTGGCAGTCCGCCTCTCCAAGGACCGATCCACACTGCAGGTGCTGGACTCGGCCAC 
fl AGGGAACTGGTTCTCTGCCTGTTTCGACAACTTCACAGAAGCTCTCGCTGAGACAGCCTGTAGGCAGATGG 
Q GCTACAGCAGAGCTGT GGAGATTGGCCCAGACCAGGATCTGGAT GT TGTTGAAATCACAGAAAACAGCCAG 

IP GAGCTTCGCATGCGGAACTCAAGTGGGCCCTGTCTCTCAGGCTCCCTGGTCTCCCTGCACTGTCTTGCCTG 
**W TGGGAAGAGCCTGAAGACCCCCCGTGTGGTGGGTGGGGAGGAGGCCTCTGTGGATTCTTGGCCTTGGCAGG 
TCAGCATCCAGTACGACAAACAGCACGTCTGTGGAGGGAGCATCCTGGACCCCCACTGGGTCCTCACGGCA 
GCCCACTGCTTCAGGAAACATACCGATGTGTTCAACTGGAAGGTGCGGGCAGGCTCAGACAAACTGGGCAG 

ill I 

CT TCCC AT CC C T GGCT GTGG CCAAGATCATCAT CAT TGAATTC AAC CCCATGT ACC CC AAAGAC AAT GAC A 
p»| TCGCCCTCATGAAGCTGCAGTTCCCACTCACTTTCTCAGGCACAGTCAGGCCCATCTGTCTGCCCTTCTTT 
ITi GATGAGGAGC TC ACTC CAGCCACCCC ACTCTGGAT C AT TGGAT GGGGCT T TACG AAGCAG AATGGAGGGAA 

fj GATGTC TG AC AT ACTG CTGC AG GCGT CAGTCC AGGTC AT T GACAGC AC AC GGT GC AAT GC AG AC GAT GCGT 

||| ACCAGGGGGAAGTCACCGAGAAGATGATGTGTGCAGGCATCCCGGAAGGGGGTGTGGACACCTGCCAGGGT 
Q GACAGTGGTGGGCCCCTGATGTACCAATCTGACCAGTGGCATGTGGTGGGCATCGTTAGCTGGGGCTATGG 
Rl CTGCGGGGGCCCGAGCACCCCAGGAGTATACACCAAGGTCTCAGCCTATCTCAACTGGATCTACAATGTCT 
GGAAGGCTGAGCTGTAATGCTGCTGCCCCTTTGCAGTGCTGGGAGCCGCTTCCTTCCTGCCCTGCCCACCT 
GGGGATCCCCCAAAGTCAGACACAGAGCAAGAGTCCCCTTGGGTACACCCCTCTGCCCACAGCCTCAGCAT 
TTCTTGGAGCAGCAAAGGGCCTCAATTCCTGTAAGAGACCCTCGCAGCCCAGAGGCGCCCAGAGGAAGTCA 
GCAGCCCTAGCTCGGCCACACTTGGTGCTCCCAGCATCCCAGGGAGAGACACAGCCCACTGAACAAGGTCT 
CAGGGGTATTGCTAAGCCAAGAAGGAACTTTCCCACACTACTGAATGGAAGCAGGCTGTCTTGTAAAAGCC 
C AGAT C ACT GT G GGCT GG AG AGGAGAAGGAAAGGGTCT GC GCCAGCCCT G TC CGT C TT C AC CCATCC C CAA 
GC CT AC TAGAGC AAGAAACC AGTTGT AAT AT AAAAT GCAC T GCC CT ACT G TT GGTATGACT ACC GT T ACC T 
AC TGTT GT C ATT GT TATT AC AGCT AT GGCC ACT AT TAT T AAAGAGCT GT GTAAC AT CT CT GG CAAAAAAAA 
AAAA 
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FIGURE 112 

MLQDPDS DQPLNS LDVKPLRKPRI PMET FRKVGI P 1 1 IALLSLAS 1 1 I VWL IKVI LDKYYFLCG 
QPLHFIPRKQLCDGELDCPLGEDEEHCVKSFPEGPAVAVRLSKDRSTLQVLDSATGNWFSACFDN 
FTEAIAETACRQMGYSRAVEIGPDQDLDVVEITENSQELRMRNSSGPCLSGSLVSLHCLACGKSL 
KTPRWGGEEASVDSWPWQVSIQYDKQHVCGGSILDPHWVLTAAHCFRKHTDVFNWKVRAGSDKL 
GS FPSLAVAKI 1 1 IEFNPMYPKDNDIALMKLQFPLTFSGTVRPICLPFFDEELTPATPLWI IGWG 
FTKQNGGKMSDILLQASVQVIDSTRCNADDAYQGEVTEKMMCAGIPEGGVDTCQGDSGGPLMYQS 
DQWHWGIVSWGYGCGGPSTPGVYTKVSAYLNWIYNVWKAEL 

Transmembrane domain: 

amino acids 32-53 (typell) 
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FIGURE lis 



GGCTGGACTGGAACTCCTGGTCCCAAGTGATCCACCCGCCTCAGCCTCCCAAGGTGCTGTGATTA 
T AGGT GT AAGC CACCGT GT CTGGCCTCT GAAC AAC T T T T T C AGCAAC T AAAAAAGC CAC AGGAG T 
TGAACTGCTAGGATTCTGACT ATG CTGTGGTGGCTAGTGCTCCTACTCCTACCTACATTAAAATC 
TGTTTTTTGTTCTCTTGTAACTAGCCTTTACCTTCCTAACACAGAGGATCTGTCACTGTGGCTCT 
GGCCCAAACCTGACCTTCACTCTGGAACGAGAACAGAGGTTTCTACCCACACCGTCCCCTCGAAG 
CCGGGGACAGCCTCACCTTGCTGGCCTCTCGCTGGAGCAGTGCCCTCACCAACTGTCTCACGTCT 
GGAGGCACTGACTCGGGCAGTGCAGGTAGCTGAGCCTCTTGGTAGCTGCGGCTTTCAAGGTGGGC 
CTTGCCCTGGCCGTAGAAGGGAT TGA CAAGCCCGAAGATTTCATAGGCGATGGCTCCCACTGCCC 
p AGGCATCAGCCTTGCTGTAGTCAATCACTGCCCTGGGGCCAGGACGGGCCGTGGACACCTGCTCA 
M< GAAGCAGT GG GT GAGACAT CACGC TGCCC GC CCAT CTAACC T T T T CAT GT CCTGCACAT CAC C T G 

ff ATCCATGGGCTAATCTGAACTCTGTCCCAAGGAACCCAGAGCTTGAGTGAGCTGTGGCTCAGACC 
g "l CAGAAGGGGTCTGCTTAGACCACCTGGTTTATGTGACAGGACTTGCATTCTCCTGGAACATGAGG 
[ft GAACGCCGGAGGAAAGCAAAGTGGCAGGGAAGGAACTTGTGCCAAATTATGGGTCAGAAAAGATG 
% GAGGTGTTGGGTTATCACAAGGCATCGAGTCTCCTGCATTCAGTGGACATGTGGGGGAAGGGCTG 
CI CCGATGGCGCATGACACACTCGGGACTCACCTCTGGGGCCATCAGACAGCCGTTTCCGCCCCGAT 
;[] CCACGTACCAGCTGCTGAAGGGCAACTGCAGGCCGATGCTCTCATCAGCCAGGCAGCAGCCAAAA 
5S TCTGCGATCACCAGCCAGGGGCAGCCGTCTGGGAAGGAGCAAGCAAAGTGACCATTTCTCCTCCC 
H CTCCTTCCCTCTGAGAGGCCCTCCTATGTCCCTACTAAAGCCACCAGCAAGACATAGCTGACAGG 
f|| GGCTAATGGCTCAGTGTTGGCCCAGGAGGTCAGCAAGGCCTGAGAGCTGATCAGAAGGGCCTGCT 
GTGCGAACACGGAAATGCCTCCAGTAAGCACAGGCTGCAAAATCCCCAGGCAAAGGACTGTGTGG 
CTCAATTTAAATCATGTTCTAGTAATTGGAGCTGTCCCCAAGACCAAAGGAGCTAGAGCTTGGTT 
CAAAT GAT C T CC AAGGGCCC TTAT AC C C CAGGAGAC T T T GAT T T GAAT T T GAAACCC CAAAT CCA 
AACCTAAGAACCAGGTGCATTAAGAATCAGTTATTGCCGGGTGTGGTGGCCTGTAATGCCAACAT 
TTTGGGAGGCCGAGGCGGGTAGATCACCTGAGGTCAGGAGTTCAAGACCAGCCTGGCCAACATGG 
TGAAACCCCTGTCTCTACTAAAAATACAAAAAAACTAGCCAGGCATGGTGGTGTGTGCCTGTATC 
CCAGCTAC T CGGGAGGC TGAGACAGGAGAAT TAC T TGAACC TGGGAGGT GAAGGAGGC T GAGACA 
GGAG AAT CAC T T CAGCC T GAGCAAC ACAGCGAGAC TC T GT C T CAGAAAAAAT AAAAAAAGAAT T A 
TGGTTATTTGTAA 
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FIGURE 114 

MLWWLVLLLLPTLKSVFCSLVTSLYLPNTEDLSLWLWPKPDLHSGTRTEVSTHTVPSKPGTASPC 
WPLAGAVPSPTVSRLEALTRAVQVAEPLGSCGFQGGPCPGRRRD 

Signal peptide: 

amino acids 1-15 
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FIGURE lis 



CAGCAGTGGT CTC TCAG T.CC TCT CAAAGC AAGGAAAGAGTACT GT GTGCTGAGAGAC CATGGCAA 
AGAAT CCT CCAGAGAAT TG T GAAGACT GT CACAT T CTAAATGC AGAAGCT T T TAAATCCAAGAAA 
ATATGTAAATCACTTAAGATTTGTGGACTGGTGTTTGGTATCCTGGCCCTAACTCTAATTGTCCT 
GTTTTGGGGGAGCAAGCACTTCTGGCCGGAGGTACCCAAAAAAGCCTATGACATGGAGCACACTT 
T C TACAGCAATGGAGAGAAGAAGAAGAT T T ACATGGAAAT TGAT C CTGT GACCAGAAC T GAAATA 
TTCAGAAGCGGAAATGGCACTGATGAAACATTGGAAGTGCACGACTTTAAAAACGGATACACTGG 
CATCTACTTCGTGGGTCTTG?\AAAATGTTTTATCAAAACTCAGATTAAAGTGATTCCTGAATTTT 
12 CT GAACCAGAAGAGGAAATAGAT G AGAAT GAAGAAAT TACCACAACT T TCT T TGAACAGT CAGT G 

III ATTTGGGTCCCAGCAGAAAAGCCTATTGAAAACCGAGATTTTCTTAAAAATTCCAAAATTCTGGA 
|f| GATTTGTGATAACGTGACCATGTATTGGATCAATCCCACTCTAATATCAGTTTCTGAGTTACAAG 
III ACT TTGAGGAGGAGGGAGAAGAT C TT CACT TTCC TGCCAACGAAAAAAAAGGGATTGAACAAAAT 

x l GAACAGTGGGTGGTCCCTCAAGTGAAAGTAGAGAAGACCCGTCACGCCAGACAAGCAAGTGAGGA 

Id 

71 AGAACT TCCAATAAATGACTATACTGAAAATGGAATAGAATTTGATCCCATGC TGGATGAGAGAG 

* r " GTTATTGTTGTATTTACTGCCGTCGAGGCAACCGCTATTGCCGCCGCGTCTGTGAACCTTTACTA 

fj GGCTACTACCCATATCCATACTGCTACCAAGGAGGACGAGTCATCTGTCGTGTCATCATGCCTTG 

||! TAACTGGTGGGTGGCCCGCATGCTGGGGAGGGTC TAA TAGGAGGTTTGAGCTCAAATGCTTAAAC 

W TGCTGGCAACATATAATAAATGCATGCTATTCAATGAATTTCTGCCTATGAGGCATCTGGCCCCT 

|Jj GGTAGCCAGCTCTCCAGAATTACTTGTAGGTAATTCCTCTCTTCATGTTCTAATAAACTTCTACA 

f|| TTATCACCAAAAAAAAAAAAAAAAAAA 
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FIGURE 116 



MAKNPPENCEDCHILNAEAFKSKKICKSLKICGLVFGILALTLIVLFWGSKHFWPEVPKKAYDME 
HTFYSNGEKKKIYMEIDPVTRTEIFRSGNGTDETLEVHDFKNGYTGIYFVGLQKCFIKTQIKVIP 
E FSE PEEE I DENEE I T TT FFEQSVI WVPAEKP IENRDFLKNSKI LE I CDNVTMYWINPTL I S VSE 
LQDFEEEGEDLHFPANEKKGIEQNEQWWPQVKVEKTRHARQASEEELPINDYTENGIEFDPMLD 
ERGYCCIYCRRGNRYCRRVCEPLLGYYPYPYCYQGGRVICRVIMPCNWWVARMLGRV 

Important features of the protein: 
Signal peptide: 

amino acids 1-40 

Transmembrane domain: 

amino acids 25-47 (type II) 

N-glycosylation sites. 

amino acids 94-97, 180-183 

Glycosaminoglycan attachment sites. 

amino acids 92-95, 70-73, 85-88, 133-136, 148-151, 192-195, 239- 
242 

N-myristoylation sites. 

amino acids 33-38, 95-100, 116-121, 215-220, 272-277 

Microbodies C- terminal targeting signal . 

amino acids 315-317 

Cytochrome c family heme-binding site signature. 

amino acids 9-14 
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FIGURE 117 



GAGCTCCCCTCAGGAGCGCGTTAGCTTCACACCTTCGGCAGCAGGAGGGCGGCAGCTTCTCGCAGGCGGCA 
GGGCGGGCGGCCAGGATCATGTCCACCACCACATGCCAAGTGGTGGCGTTCCTCCTGTCCATCCTGGGGCT 
GGCCGGCTGCATCGCGGCCACCGGGATGGACATGTGGAGCACCCAGGACCTGTACGACAACCCCGTCACCT 
CCGTGTTCCAGTACGAAGGGCTCTGGAGGAGCTGCGTGAGGCAGAGTTCAGGCTTCACCGAATGCAGGCCC 
TATTTCACCATCCTGGGACTTCCAGCCATGCTGCAGGCAGTGCGAGCCCTGATGATCGTAGGCATCGTCCT 
GGGTGCCATTGGCCTCCTGGTATCCATCTTTGCCCTGAAATGCATCCGCATTGGCAGCATGGAGGACTCTG 
CCAAAGCCAACATGACACTGACCTCCGGGATCATGTTCATTGTCTCAGGTCTTTGTGCAATTGCTGGAGTG 
TCTGTGTTTGCCAACATGCTGGTGACTAACTTCTGGATGTCCACAGCTAACATGTACACCGGCATGGGTGG 
GATGGTGCAGACTGTTCAGACCAGGTACACATTTGGTGCGGCTCTGTTCGTGGGCTGGGTCGCTGGAGGCC 
T C AC AC TAAT TG GGGGTGTG AT GAT GTG C ATCGC CT GCCGGGGCCT GGCACC AG AAGAAACC AACT AC AAA 
GCCGTTTCTTATCATGCCTCAGGCCACAGTGTTGCCTACAAGCCTGGAGGCTTCAAGGCCAGCACTGGCTT 
TGGGTCCAACACCAAAAACAAGAAGATATACGATGGAGGTGCCCGCACAGAGGACGAGGTACAATCTTATC 
CTTC CAAGCACGACT ATGT GTAAT GCTCT AAGAC CT CTCAGC ACGGGC GGAAGAAACT CCCGGAGAGC TC A 
CCCAAAAAACAAGGAGATCCCATCTAGATTTCTTCTTGCTTTTGACTCACAGCTGGAAGTTAGAAAAGCCT 
CGATTTCATCTTTGGAGAGGCCAAATGGTCTTAGCCTCAGTCTCTGTCTCTAAATATTCCACCATAAAACA 
GCTGAGTT AT TT AT GAATT AGAGGCT AT AGCTCAC ATT TT CAAT CCTC TATT TC TTT T TT T AAAT ATAACT 
TTCTACTCTGATGAGAGAATGTGGTTTTAATCTCTCTCTCACATTTTGATGATTTAGACAGACTCCCCCTC 
TTCCTCCTAGTCAATAAACCCATTGATGAT CT AT TTCCCAGCTTAT CCCCAAGAAAACTTTTGAAAGGAAA 
GAGTAGACCCAAAGATGTTATTTTCTGCTGTTTGAATTTTGTCTCCCCACCCCCAACTTGGCTAGTAATAA 
AC AC T T AC T G AAGAAGAAGC AAT AAG AGAAAG AT AT TT GT AAT CTC TCC AGCCC ATGATC TCGGT T T T CT T 
ACACTGTGATCTTAAAAGTTACCAAACCAAAGTCATTTTCAGTTTGAGGCAACCAAACCTTTCTACTGCTG 
TT GAC ATCTT CT T AT T ACAGCAAC AC CATT CT AGGAGTTT CC T GAGCT CTCC AC T GG AGT CCT C TT T C TGT 
CGCGGGTC AGAAATT GTCCCT AGAT G AATGAGAAAAT TAT TT TTTT TAATT T AAGT CCT AAAT AT AGT TAA 
AATAAATAATGTTTTAGTAAAATGATACACTATCTCTGTGAAATAGCCTCACCCCTACATGTGGATAGAAG 
GAAAT GAAAAAAT AAT T GCT T T GAC ATT GT CT AT AT GGT ACT T T GT AAAGT CAT GCT T AAGT AC AAAT TC C 
ATGAAAAGCTCACACCTGTAATCCTAGCACTTTGGGAGGCTGAGGAGGAAGGATCACTTGAGCCCAGAAGT 
' TCGAGACTAGCCTGGGCAACATGGAGAAGCCCTGTCTCTACAAAATACAGAGAGAAAAAATCAGCCAGTCA 
TGGTGGCATACACCTGTAGTCCCAGCATTCCGGGAGGCTGAGGTGGGAGGATCACTTGAGCCCAGGGAGGT 
T GGGGCTGCAGTGAGC CATGAT CACACC AC TG CACT CCAGCCAGGT GAC AT AGC GAGATC CTGT CT AAAAA 
AATAAAAAATAAATAATGGAACACAGCAAGTCCTAGGAAGTAGGTTAAAACTAATTCTTTAA 



APP_ID=10063735 



Page 271 of 322 



FIGURE 118 



MSTTTCQWAFLLSILGLAGCIAATGMDMWSTQDLYDNPVTSVFQYEGLWRSCVRQSSGFTECRP 
YFTILGLPMLQAVRAmiVGIVLGAIGLLVSIFALKCIRIGSMEDSAKANMTLTSGIMFIVSGL 
CAI AGVS VFANMLVTNFWMS TANMYTGMGOIV QTVQTRYT FGAAL FVGWVAGGL TL I GGVMMC I A 
CRGLAPEETNYKAVSYHASGHSVAYKPGGFKASTGFGSNTKNKKIYDGGARTEDEVQSYPSKHDY 
V 

Signal peptide : 

amino acids 1-23 

Transmembrane domains: 

amino acids 81-100, 121-141, 173-194 



m 
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FIGURE HQ 



CP 



m 



G GAAAAACT GTTCTCTTCTG T GGCACAGAGAAC C C T GCT T C AAAG C AGAAGTAGCAG TT C C GGAG T C C 
AGCTGGCTAAAACTCATCCCAGAGGATAATGGCAACCCATGCCTTAGAAATCGCTGGGCTGTTTCTTG 
GTGGTGTTGGAATGGTGGGCACAGTGGCTGTCACTGTCATGCCTCAGTGGAGAGTGTCGGCCTTCATT 
GAAAACAACATCGTGGTTTTTGAAAACTTCTGGGAAGGACTGTGGATGAATTGCGTGAGGCAGGCTAA 
CATCAGGATGCAGTGCAAAATCTATGATTCCCTGCTGGCTCTTTCTCCGGACCTACAGGCAGCCAGAG 
GACTGATGTGTGCTGCTTCCGTGATGTCCTTCTTGGCTTTCATGATGGCCATCCTTGGCATGAAATGC 
ACC AG GT G CAC G G G GG ACAAT GAG AAG GT GAAG GC T C AC AT T C T G C T GAC G GC T GG AAT CAT C T T CAT 
CATCACGGGCATGGTGGTGCTCATCCCTGTGAGCTGGGTTGCCAATGCCATCATCAGAGATTTCTATA 
ACTCAATAGTGAATGTTGCCCAAAAACGTGAGCTTGGAGAAGCTCTCTACTTAGGATGGACCACGGCA 
CTGGTGCTGATTGTTGGAGGAGCTCTGTTCTGCTGCGTTTTTTGTTGCAACGAAAAGAGCAGTAGCTA 
C AGAT AC T C GAT ACC T T CCC ATCGCAC AAC C CAAAAAAG T TAT CAC AC C GG AAAGAAGT CACCGAGC G 
T CT AC TCCAGAAGTCAGTAT GTG TAG TT GTGTATGT t TTT T TAACTTTACT ATAAAGCCATGCAAATG 
ACAAAAATCTATATTACTTTCTCAAAATGGACCCCAAAGAAACTTTGATTTACTGTTCTTAACTGCCT 
AATCTTTVATTACAGGAACTGTGCATCAGCTATTTATGATTCTATAAGCTATTTCAGCAGAATGAGATA 
* T TAAACC C AAT G C T T T GAT T GT T C T AGAAAGT ATAG TAAT TTGTTTTC T AAGG T GGT TCAAGC AT C T A 

.a K& 

!f;f CTCTTTTTATCATTTACTTCAAAATGACATTGCTAAAGACTGCATTATTTTACTACTGTAATTTCTCC 
%l AC GAC ATAGCAT TAT GT AC AT AGAT GAGT GT AACAT T T AT ATC T CAC AT AGAGACAT GC T TAT AT GGT 

&i TTTAT TTAAAATGAAAT GCCAGTC CAT TACACT GAATAAATAGAACTCAACTATTGCTTT TCAGGGAA 

P! ATC AT GGATAGGGTT GAAGAAGGT TACTATTAATT GT TT AAAAACAGC T TAGGGAT TAAT GT C CT CCA 

f|| TTTAT AAT GAAGAT T AAAAT GAAGGCT TT AATCAGCATT GTAAAGGAAAT T GAATGGCTT TCT GAT AT 

GCTGTTTTTTAGCCTAGGAGTTAGAAATCCTAACTTCTTTATCCTCTTCTCCCAGAGGCTTTTTTTTT 
CTTGTGTATTAAATTAACATTTTTAAAACGCAGATATTTTGTCAAGGGGCTTTGCATTCAAACTGCTT 
TTCCAGGGCTATACTCAGAAGAAAGATAAAAGTGTGATCTAAGAAAAAGTGATGGTT TTAGGAAAGT G 
AAAAT ATT TTT GTT T TT GTATTTGAAGAAGAAT GATGCAT T TT GACAAGAAATCATATAT GTATGGAT 
ATAT T TTAATAAGTATT TGAGTACAGACT TT GAGGTT TCATCAATATAAATAAAAGAGCAGAAAAAT A 
TGTCTTGGTTTTCATTTGCTTACCAAAAAAACAACAACAAAAAAAGTTGTCCTTTGAGAACTTCACCT 
GCTCCTATGTGGGTACCTGAGTCAAAATTGTCATTTTTGTTCTGTGAAAAATAAATTTCCTTCTTGTA 
CCATT TCT GTTTAGT TT TACTAAAATCT GTAAATACT GT AT TT T TCT GTT TAT TCCAAAT T T GAT GAA 
ACTGACAATCCAATTTGAAAGTTTGTGTCGACGTCTGTCTAGCTTAAATGAATGTGTTCTATTTGCTT 
TATACATTTATATTAATAAATTGTACATTTTTCTAATT 
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FIGURE 120 

MATHALEIAGLFLGGVGMVGTVAVTVMPQ 

IYDSLLALSPDLQAARGLMCAASVMS FLAFMMAILGMKCTRCTGDNEKVKAHILLTAGI I FI ITG 
MWLI PVSWVANAI IRDFYNS I TOVAQKRELGEALYLGWTTALVL I VGGAL FCC VFCCNEKS S S Y 
RYSIPSHRTTQKSYHTGKKSPSVYSRSQYV 

Signal peptide: 

amino acids 1-17 

Transmembrane domains: 

amino acids 82-101, 118-145, 164-188 
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FIGURE 121 

GGAGAGAGGCGCGCGGGTGAAAGGCGCATTGATGCAGCCTGCGGCGGCCTCGGAGCGCGGCGGAG 
CCAGACGCTGACCACGTTCCTCTCCTCGGTCTCCTCCGCCTCCAGCTCCGCGCTGCCCGGCAGCC 
GGGAGCCATGCGACCCCAGGGCCCCGCCGCCTCCCCGCAGCGGCTCCGCGGCCTCCTGCTGCT.ee 
TGCTGCTGCAGCTGCCCGCGCCGTCGAGCGCCTCTGAGATCCCCAAGGGGAAGCAAAAGGCGCAG 
CTCCGGCAGAGGGAGGTGGTGGACCTGTATAATGGAATGTGCTTACAAGGGCCAGCAGGAGTGCC 
TGGTCGAGACGGGAGCCCTGGGGCCAATGTTATTCCGGGTACACCTGGGATCCCAGGTCGGGATG 
GATTCAAAGGAGAAAAGGGGGAAT GT C T GAGGGAAAGCT TTGAGGAGT C CT GGACACCCAACTAC 
AAGCAGTGTTCATGGAGTTCATTGAATTATGGCATAGATCTTGGGAAAATTGCGGAGTGTACATT 
TACAAAGATGCGTTCAAATAGTGCTCTAAGAGTTTTGTTCAGTGGCTCACTTCGGCTAAAATGCA 
GAAATGCATGCTGTCAGCGTTGGTATTTCACATTCAATGGAGCTGAATGTTCAGGACCTCTTCCC 
AT T GAAG C T ATAAT T TAT T T GGAC CAAG GAAGC C C T GAAAT GAAT T CAAC AAT T AAT AT T CAT CG 
CACTTCTTCTGTGGAAGGACTTTGTGAAGGAATTGGTGCTGGATTAGTGGATGTTGCTATCTGGG 
TTGGCACTTGTTCAGATTACCCAAAAGGAGATGCTTCTACTGGATGGAATTCAGTTTCTCGCATC 
AT TAT T GAAGAAC TACCAAR ATAAA TGCTT TAAT T T TCAT T TGCTACC TCTTTTTT TAT TATGCC 
T TGGAATGGT TCAC TT AAAT GACAT T T T AAATAAGT T TATG TAT ACAT C T GAATGAAAAGCAAAG 
CTAAAT AT GTT TACAGACCAAAGT GT GATT TCACACTGT TT TTAAATCTAGCAT TAT TCAT T TT G 
CTTCAATCAAAAGTGGTTTCAATATTTTTTTTAGTTGGTTAGAATACTTTCTTCATAGTCACATT 
CTCTCAACCTATAATTTGGAATATTGTTGTGGTCTTTTGTTTTTTCTCTTAGTATAGCATTTTTA 
AAAAAAT AT AAAAG C T AC C AAT C T T T G T AC AAT T T G T AAAT G T T AAGAAT t T T T T T TAT AT C T G T 
TAAATAAAAAT TAT T T CCAACA 
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FIGURE 122 



MRPQGPAASPQRLRGLLLLLLLQLPAPSSASEIPKGKQKAQLRQREWDLYNGMCLQGPAGVPGR 
DGSPGMFVIPGTPGIPGRDGFKGEKGECLRESFEESWTPNYKQCSWSSLNYGIDLGKIAECTFTK 
MRSNS ALRVL FS GS LRLKCRNACCQRW Y FT FNGAECS GPL P I E AI I YL DQGS PEMNS T INI HRT S 
SVEGLCEGIGAGLVDVAIWVGTCSDYPKGDASTGWNSVSRIIIEELPK 

Signal peptide: 

amino acids 1-30 

Transmembrane domain: 

amino acids 195-217 
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FIGURE 122 



GCTGAGCGTGTGCGCGGTACGGGGCTCTCCTGCCTTCTGGGCTCCAACGCAGCTCTGTGGCTGAA 
C TGGGTGC TCATCACGGGAACT GCTGGGCTATGGAATACAGAT GT GGCAGCTCAGGT AGCCCCAA 
ATTGCCTGGAAGAATACATCATGTTTTTCGATAAGAAGAAATTGTAGGATCCAGTTTTTTTTTTA 
ACCGCCCCCTCCCCACCCCCCAAT^AAAACTGTAAAGATGCAAAAACGTAATATCCATGAAGATCC 
TATTACCTAGGAAGATTTTGATGTTTTGCTGCGAATGCGGTGTTGGGATTTATTTGTTCTTGGAG 
TGTTCTGCGTGGCTGGCAAAGAATAATGTTCCAAAATCGGTCCATCTCCCAAGGGGTCCAATTTT 
TCTTCCTGGGTGTCAGCGAGCCCTGACTCACTACAGTGCAGCTGACAGGGGCTGTCATGCAACTG 
GCCCCTAAGCCAAAGCAAAAGACCTAAGGACGACCTTTGAACAATACAAAGGATGGGTTTCAATG 
TAATTAGGCTACTGAGCGGATCAGCTGTAGCACTGGTTATAGCCCCCACTGTCTTACTGACAATG 
CTTTCTTCTGCCGAACGAGGATGCCCTAAGGGCTGTAGGTGTGAAGGCAAAATGGTATATTGTGA 
ATCTCAGAAATTACAGGAGATACCCTCAAGTATATCTGCTGGTTGCTTAGGTTTGTCCCTTCGCT 
AT AAC AGCC TTCAAAAAC T TAAGT AT AAT CAAT T T AAAGGGC TCAACC AGCT CACC T GGC TAT AC 
CTTGACCATAACCATATCAGCAATATTGACGAAAATGCTTTTAATGGAATACGCAGACTCAAAGA 
G C T GAT T C T TAG T T C CAAT AG AAT C T C C T AT T T T C T T AAC AAT AC C T T C AG AC C T G T GAG AAAT T 
TACGGAACTTGGATCTGTCCTATAATCAGCTGCATTCTCTGGGATCTGAACAGTTTCGGGGCTTG 
CGGAAGCTGCTGAGTTTACATTTACGGTCTAACTCCCTGAGAACCATCCCTGTGCGAATATTCCA 
AGACTGCCGCAACCTGGAACTTTTGGACCTGGGATATAACCGGATCCGAAGTTTAGCCAGGAATG 
TCTTTGCTGGCATGATCAGACTCAAAGAACTTCACCTGGAGCACAATCAATTTTCCAAGCTCAAC 
CTGGCCCTTTTTCCAAGGTTGGTCAGCCTTCAGAACCTTTACTTGCAGTGGAATAAAATCAGTGT 
CAT AG GACAGAC CATG TCC TGGACCTGGAGCTCC T TACAAAGGCT T GAT T TATCAGGCAATGAGA 
TCGAAGCTTTCAGTGGACCCAGTGTTTTCCAGTGTGTCCCGAATCTGCAGCGCCTCAACCTGGAT 
T CCAACAAGCTCACAT TTAT TGGTCAAGAGAT TT TGGATTC TTGGATATCCCTCAATGACATCAG 
TCTTGCTGGGAATATATGGGAATGCAGCAGAAATATTTGCTCCCTTGTAAACTGGCTGAAAAGTT 
TTAAAGGTCTAAGGGAGAATACAATTATCTGTGCCAGTCCCAAAGAGCTGCAAGGAGTAAATGTG 
ATCGATGCAGTGAAGAACTACAGCATCTGTGGCAAAAGTACTACAGAGAGGTTTGATCTGGCCAG 
GGCTCTCCCAAAGCCGACGTTTAAGCCCAAGCTCCCCAGGCCGAAGCATGAGAGCAAACCCCCTT 
TGCCCCCGACGGTGGGAGCCACAGAGCCCGGCCCAGAGACCGATGCTGACGCCGAGCACATCTCT 
TTCCATAAAATCATCGCGGGCAGCGTGGCGCTTTTCCTGTCCGTGCTCGTCATCCTGCTGGTTAT 
CTACGTGTCATGGAAGCGGTACCCTGCGAGCATGAAGCAGCTGCAGCAGCGCTCCCTCATGCGAA 
GGCACAGGAAAAAGAAAAGACAGTCCCTAAAGCAAATGACTCCCAGCACCCAGGAAT TT TATGTA 
GATTATAAACCCACCAACACGGAGACCAGCGAGATGCTGCTGAATGGGACGGGACCC TGCACC TA 
TAACAAATCGGGCTCCAGGGAGTGTGAGGTATGAACCATTGTGATAAAAAGAGCTCTTAAAAGCT 
GGGAAATAAGTGGTGCTTTATTGAACTCTGGTGACTATCAAGGGAACGCGATGCCCCCCCTCCCC 
TTCCCTCTCCCTCTCACTTTGGTGGCAAGATCCTTCCTTGTCCGTTTTAGTGCATTCATAATACT 
GGTC AT TTTCCTCT CATACATAAT CAAC CC AT T GAAAT T T AAAT AC CACAATCAATGTGAAGC T T 
GAACTCCGGTTTAATATAATACCTATTGTATAAGACCCTTTACTGATTCCATTAATGTCGCATTT 
G T T T T AAGAT AAAAC T T C T T T C AT AGG T AAAAAAAAAAA 
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FIGURE 124 



MGFNVIRLLSGSAVALVIAPTVLLTMLSSAERGCPKGCRCEGKMVYCESQKLQEIPSSISAGCLG 
LSLRYNSLQKLKYNQFKGLNQLTWLYLDHNHISNIDENAFNGIRRLKELILSSNRISYFLNNTFR 
PVTNLRNLDLSYNQLHSLGSEQFRGLRKLLSLHLRSNSLRTIPVRIFQDCRNLELLDLGYNRIRS 
LARNVFAGMIRLKELHLEHNQFSKLNLA^ 

SGNE I EAFS GPS VFQCVPNLQRLNLDSNKLT FI GQE I LDSW I S LND I S LAGN I WECSRN I CSLVN 
WLKSFKGLRENTIICASPKELQGW^IDAVKNYSICGKSTTERFDIARALPKPTFKPKLPRPKHE 
SKPPLPPTVGATEPGPETDADAEHISFHKI IAGSVALFLSVLVILLVI YVSWKRYPASMKQLQQR 
SLMRRHRKKKRQSLKQMTPSTQE FYVDYKPTNTETSEMLLNGTGPCTYNKSGSRECEV 

Important features of the protein: 
Signal peptide: 

amino acids 1-33 

Transmembrane domain: 

amino acids 420-442 

N-glycosylation sites. 

amino acids 126-129, 357-360, 496-499, 504-507 

gAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 465-468 

Tyrosine kinase phosphorylation site. 

amino acids 136-142 

N-myristoylation sites. 

amino acids 11-16, 33-38, 245-250, 332-337, 497-502, 507-512 
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FIGURE 125 



% 2 



CCGTTATCGTCTTGCGCTACTGCTGAATCTCCGTCCCGGAGGAGGAGGAGAGGCTTTTGCCGCTG 
ACCCAGAGATGGCCCCGAGCGAGCAAATTCCTACTGTCCGGCTGCGCGGCTACCGTGGCCGAGCT 
AGCAACCTTTCCCCTGGATCTCACAAAAACTCGACTCCAAATGCAAGGAGAAGCAGGTCTTGCTC 
GGTTGGGAGACGGTGCAAGAGAATCTGCCCCCTATAGGGGAATGGTGCGCACAGCCCTAGGGATC 
ATTGAAGAGGAAGGCTTTCTAAAGCTTTGGCAAGGAGTGACACCCGCCATTTACAGACACGTAGT 
G TAT T C T G GAGG T C GAAT GG T CACAT ATGAACATC T CCGAGAGG TTGTGTTT GGC AAAAGTGAAG 
ATGAGCATTATCCCCTTTGGAAATCAGTCATTGGAGGGATGATGGCTGGTGTTATTGGCCAGTTT 
T TAGC C AAT C CAAC T GAC C TAG T G AAG G T T C AGAT G C AAAT GGAAG G AAAAAGG AAAC T GG AAG G 
AAAAC CAT T GCGAT T T CGTGGT GT AC AT CAT GC AT T T GC AAAAAT C T T AGCT GAAGGAGGAAT AC 
GAGGGCTTTGGGCAGGCTGGGTACCCAATATACAAAGAGCAGCACTGGTGAATATGGGAGATTTA 
ACCACT TATGAT ACAGTGAAACACTACT T GGTATTGAATACACCACT TGAGGACAATATCATGAC 
TCACGGTTTATCAAGTTTATGTTCTGGACTGGTAGCTTCTATTCTGGGAACACCAGCCGATGTCA 
T CAAAAGCAGAA.TAAT GAAT CAAC CACGAGATAAACAAGGAAGGGGAC T T T T G TATAAATCAT CG 
ACTGACTGCTTGATTCAGGCTGTTCAAGGTGAAGGATTCATGAGTCTATATAAAGGCTTTTTACC 
ATCTTGGCTGAGAATGACCCCTTGGTCAATGGTGTTCTGGCTTACTTATGAAAAAATCAGAGAGA 
TGAGTGGAGTCAGTCCATTTTAA 
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FIGURE 126 



MSVPEEEERLLPLTQRWPRASKFLLSGCAATVAELATFPLDLTKTRLQMQGEAALARLGDGARES 
APYRGMVRTALGIIEEEGFLKLWQGVTPAIYRHWYSGGRMVTYEHLREWFGKSEDEHYPLWKS 
VIGGMmGVIGQFI^PTDLVKVQMQMEGKRKLEGKPLRFRGVHHAFAKII^GGIRGLWAGWVP 
NIQRAALVNMGDLTTYDTVKHYLVLNTPLEDNIMTHGLSSLCSGLVASILGTPADVIKSRIMNQP 
RDKQGRGLL YKS S TDCL I QAVQGEG FMSLYKGFLP S WLRMT PWSMVFWLT YEKIREMS GVS P F 

Transmembrane domains: 

amino acids 25-38, 130-147, 233-248 
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FIGURE 127 



CGCGGATCGGACCCAAGCAGGTCGGCGGCGGCGGCAGGAGAGCGGCCGGGCGTCAGCTCCTCGAC 
CCCCGTGTCGGGCTAGTCCAGCGAGGCGGACGGGCGGCGTGGGCCCATGGCCAGGCCCGGCATGG 
AGCGGTGGCGCGACCGGCTGGCGCTGGTGACGGGGGCCTCGGGGGGCATCGGCGCGGCCGTGGCC 
CGGGCCCTGGTCCAGCAGGGACTGAAGGTGGTGGGCTGCGCCCGCACTGTGGGCAACATCGAGGA 
GCTGGCTGCTGAATGTAAGAGTGCAGGCTACCCCGGGACTTTGATCCCCTACAGATGTGACCTAT 
CAAAT GAAGAGGACAT C C T C TCCAT G T T C TCAGC TATCCGTT CT CAGCACAGCGG TG TAGACAT C 
TGCATCAACAATGCTGGCTTGGCCCGGCCTGACACCCTGCTCTCAGGCAGCACCAGTGGTTGGAA 

licit 

GGACATGTTCAATGTGAACGTGCTGGCCCTCAGCATCTGCACACGGGAAGCCTACCAGTCCATGA 
,p AGGAGCGGAATGTGGACGATGGGCACATCATTAACATCAATAGCATGTCTGGCCACCGAGTGTTA 
CH CCCCTGTCTGTGACCCACTTCTATAGTGCCACCAAGTATGCCGTCACTGCGCTGACAGAGGGACT 
|W GAGGCAAGAGCTTCGGGAGGCCCAGACCCACATCCGAGCCACGTGCATCTCTCCAGGTGTGGTGG 
§ „1 AGACACAAT T CGC C TT CAAACTC CACGACAAGGAC CCTGAGAAGGCAGCT GCCAC C TAT GAGCAA 

is..; 

jfi ATGAAGTGTCTCAAACCCGAGGATGTGGCCGAGGCTGTTATCTACGTCCTCAGCACCCCCGCACA 

i CAT CCAGAT T GGAGACATCCAGATGAGGCC CACGGAGCAGGTGACC TAG T GACTGT GGGAGC TCC 

0 TCCTTCCCTCCCCACCCTTCATGGCTTGCCTCCTGCCTCTGGATTTTAGGTGTTGATTTCTGGAT 

m 

CACGGGATACCACTTCCTGTCCACACCCCGACCAGGGGCTAGAAAATTTGTTTGAGATTTTTATA 

O . 

m TCATCTTGTCAAATTGCTTCAGTTGTAAATGTGAAAAATGGGCTGGGGAAAGGAGGTGGTGTCCC 
TAATTGTTTTACTTGTTAACTTGTTCTTGTGCCCCTGGGCACTTGGCCTTTGTCTGCTCTCAGTG 

flJ TCTTCCCTTTGACATGGGAAAGGAGTTGTGGCCAAAATCCCCATCTTCTTGCACCTCAACGTCTG 
TGGCTCAGGGCTGGGGTGGCAGAGGGAGGCCTTCACCTTATATCTGTGTTGTTATCCAGGGCTCC 
AGACTTCCTCCTCTGCCTGCCCCACTGCACCCTCTCCCCCTTATCTATCTCCTTCTCGGCTCCCC 
AGCCCAGTCTTGGCTTCTTGTCCCCTGCTGGGGTCATCCCTCCACTCTGACTCTGACTATGGCAG 
CAGAACACCAGGGCCTGGCCCAGTGGATTTCATGGTGATCATTAAAAAAGAAAAATCGCAACCAA 
AAAAAAAAAA 
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FIGURE 128 

MARPGMERWRDRLALVTGASGGIGAAVARALVQQGLKWGCARTVGNIEELAAECKSAGYPGTLI 
PYRCDLSNEEDILSMFSAIRSQHSGVDICINNAGLARPDTLLSGSTSGWKDMFNVNVLALSICTR 
EAYQSMKERNVDDGHIININSMSGHRVLPLSVTHFYSATKYAVTALTEGLRQELREAQTHIRATC 
I S PGWE TQ FAFKLHDKDPEKAAAT YEQMKCLKPEDVAEAVI YVLS TPAH I QI GD I QMRP TE QVT 

Important features of the protein: 
Signal peptide: 

amino acids 1-17 

N-myristoylation sites. 

amino acids 18-24, 21-27, 22-28, 24-30, 40-46, 90-96, 109-115, 
199-205 

Short-chain alcohol dehyrogenase. 

amino acids 30-42, 104-114 
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FIGURE 12Q 



AACTTCTACATGGGCCTCCTGCTGCTGGTGCTCTTCCTCAGCCTCCTGCCGGTGGCCTACACCAT 
CATGTCCCTCCCACCCTCCTTTGACTGCGGGCCGTTCAGGTGCAGAGTCTCAGTTGCCCGGGAGC 
ACCTCCCCTCCCGAGGCAGTCTGCTCAGAGGGCCTCGGCCCAGAATTCCAGTTCTGGTTTCATGC 
CAGCCTGTAAAAGGCCATGGAACTTTGGGTGAATCACCGATGCCATTTAAGAGGGTTTTCTGCCA 
GGATGGAAATGTTAGGTCGTTCTGTGTCTGCGCTGTTCATTTCAGTAGCCACCAGCCACCTGTGG 
CCGTTGAGTGCTTGAAATGAGGAACTGAGAAAATTAATTTCTCATGTATTTTTCTCATTTATTTA 
T T AAT T T TT AAC TGATAGT TGTACATAT T TGGGGGTACAT GT GATATT T GGATACATGTATACAA 
H TAT AT AAT GATCAAATCAGGG TAACT GGGATATCCAT CACATCAAACAT T TAT TT T T TATTCT T T 

If TTAGACAGAGTCTCACTCTGTCACCCAGGCTGGAGTGCAGTGGTGCCATCTCAGCTTACTGCAAC 

if ^7. 

S CTCTGCCTGCCAGGTTCAAGCGATTCTCATGCCTCCACCTCCCAAGTAGCTGGGACTACAGGCAT 

S s | GCACCACAATGCCCAACTAATTTTTGTATTTTTAGTAGAGACGGGGTTTTGCCATGTTGCCCAGG 

SI CTGGCCTTGAACTCCTGGCCTCAAACAATCCACTTGCCTCGGCCTCCCAAAGTGTTATGATTACA 

U) GGCGTGAGCCACCGTGCCTGGCCTAAACATTTATCTTTTCTTTGTGTTGGGAACTTTGAAATTAT 

^ 1 ACAATGAATTATTGTTAACTGTCATCTCCCTGCTGTGCTATGGAACACTGGGACTTCTTCCCTCT 

far , ATCTAACTGTATATTTGTACCAGTTAACCAACCGTACTTCATCCCCACTCCTCTCTATCCTTCCC 

||1 AACCTCTGATCACCTCATTCTACTCTCTACCTCCATGAGATCCACTTTTTTAGCTCCCACATGTG 

Q AGTAAGAAAATGCAATATTTGTCTTTCTGTGCCTGGCTTATTTCACTTAACATAATGACTTCCTG 

Si TTCCATCCATGTTGCTGCAAATGACAGGATTTCGTTCTTAATTTCAATTAAAATAACCACACATG 

H gcaaaaa 
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FIGURE 13Q 

MGLLLLVLFLSLLPVAYTIMSLPPSFDCGPFRCRVSVAREHLPSRGSLLRGPRPRIPVLVSCQPV 
KGHGTLGESPMPFKRVFCQDGNVRSFCVCAVHFSSHQPPVAVECLK 

Important features of the protein: 
Signal peptide: 

amino acids 1-18 

N-myristoylation site. 

amino acids 86-92 

Zinc carboxypeptidases , zinc-binding region 2 signature. 

amino acids 68-79 
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FIGURE 121 



T T CT GAAGT AAC GG AAGCT ACCTT GT AT AAAG AC CT C AAC ACT GCT GACC ATGAT C AGCGC AGC CT GGAGC 
ATCTTCCTCATCGGGACTAAAATTGGGCTGTTCCTTCAAGTAGCACCTCTATCAGTTATGGCTAAATCCTG 
TCCATCTGTGTGTCGCTGCGATGCGGGTTTCATTTACTGTAATGATCGCTTTCTGACATCCATTCCAACAG 
GAAT AC C AGAGG AT GCTAC AACTC TC T ACC T TC AGAACAACC AAAT AAAT AAT GCT GGGATTCC TT CAGAT 
T T GAAAAACT TGCT GAAAGT AG AAAGAAT AT ACC T AT ACC AC AACAGT T TAG AT GAAT TT CC T ACC AACCT 
CC CAAAGT AT GT AAAAGAGT T ACAT TTGCAAGAAAAT AAC AT AAGG ACT ATC AC TT ATGATTCACT TT C AA 
AAATTCCCTATCTGGAAGAATTACATTTAGATGACAACTCTGTCTCTGCAGTTAGCATAGAAGAGGGAGCA 
TTCCGAGACAGCAACTATCTCCGACTGCTTTTCCTGTCCCGTAATCACCTTAGCACAATTCCCTGGGGTTT 
GCCCAGGACTATAGAAGAACTACGCTTGGATGATAATCGCATATCCACTATTTCATCACCATCTCTTCAAG 
GTCTCACTAGTCTAAAACGCCTGGTTCTAGATGGAAACCTGTTGAACAATCATGGTTTAGGTGACAAAGTT 
TTCTTCAACCTAGTTAATTTGACAGAGCTGTCCCTGGTGCGGAATTCCCTGACTGCTGCACCAGTAAACCT 
TCCAGGCACAAACCT GAG G AAGCT T TAT CT TC AAGAT AAC CACATC AAT CGGGTGCCCCC AAAT GCTT TTT 
CT T AT CTAAGGC AGCT CT AT CGACT GGATATGT C CAAT AAT AACCT AAGTAATT TACCTCAGGGT AT CTTT 
GATG AT TT GGAC AAT ATAAC ACAACTG ATT CTT C GC AACAAT CCCT GGT AT T GC GGGT GC AAGATG AAAT G 
GGTACGTGACTGGTTACAATCACTACCTGTGAAGGTCAACGTGCGTGGGCTCATGTGCCAAGCCCCAGAAA 
AGGT TCGT GG GATGGCTAT T AAGGATCT CAATGC AGAACT GT T T GATT GT AAGG ACAGTGGGAT TGT AAGC 
ACCATT CAGAT AAC CACT GC AATACC C AAC AC AGTGT AT CCT G CCCAAGGAC AGTGGCCAGCTC C AGT GAC 
CAAACAGC CAGAT ATT AAGAACCCC AAGCT CACTAAGGAT C AAC AAACC AC AGG GAGT CCCTCAAGAAAAA 
CAATTACAATTACTGTGAAGTCTGTCACCTCTGATACCATTCATATCTCTTGGAAACTTGCTCTACCTATG 
AC TGCTTT GAGACT C AGCT G GCTT AAAC TG GGCC AT AGCC CG G CAT TT GGAT CT AT AACAGAAAC AAT TGT 
AACAGGGG AACGCAGT GAGT ACTT GGT C AC AGCC CT GGAGCC T GAT TC ACCC T AT AAAGT ATGC AT GGTT C 
CCATGGAAACCAGCAACCTCTACCTATTTGATGAAACTCCTGTTTGTATTGAGACTGAAACTGCACCCCTT 
CGAATGT AC AACCCT AC AAC C ACCCT C AAT CG AG AGC AAG AG AAAGAACCTT AC AAAAAC CCC AAT T T AC C 
TTTGGCTGCCATCATTGGTGGGGCTGTGGCCCTGGTTACCATTGCCCTTCTTGCTTTAGTGTGTTGGTATG 
T T CAT AGG AATGGATC GCT CTT CT C AAGGAACT GTGC AT AT AG C AAAGGGAGGAGAAG AAAGGAT GAC TAT 
GCAGAAGCTGGCACTAAGAAGGACAACTCTATCCTGGAAATCAGGGAAACTTCTTTTCAGATGTTACCAAT 
AAGCAATG AACCCAT CTCG AAG GAGG AGTT TGT AAT AC AC AC C ATATT TCCT CCT AAT GGAAT G AAT CT GT 
ACAAAAACAATCACAGTGAAAGCAGTAGTAACCGAAGCTACAGAGACAGTGGTATTCCAGACTCAGATCAC 
T C ACAC TC ATGATGCT GAAGGACT C AC AGC AGAC TT GT GT TT TGGGTT TTTT AAACC TAAGGGAGGTGATG 
GT 
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FIGURE 132 

MISAAWSIFLIGTKIGLFLQVAPLSVMAKSCPSVCRCDAGFIYCNDRFLTSIPTGIPEDATTLYL 
QNNQI NNAG I PS DLKNLLKVER I YL YHNS LDE FPTNL PKYVKELHLQENNI RT I T YDS L SKI P YL 
EELHLDDNSVSAVS IEEGAFRDSNYLRLLFLSRNHLS T I PWGLPRT I EELRLDDNR ISTISSPSL 
QGLTSLKRLVLDGNLLNNHGLGDKVFFNLVNLTELSLVRNSLTAAPVNLPGTNLRKLYLQDNHIN 
RVPPNAFSYLRQLYRLDMSNNNLSNLPQGIFDDLDNITQLILRNNPWYCGCKMKWRDWLQSLPV 
KVNVRGLMCQAPEKVRGMAI KDLNAE L FDCKDS G I VS T I Q I TTAI PNTVYPAQGQWPAPVTKQPD 
IKNPKLTKDQQTTGS P S RKT I T I TVKS VTS DT I H I S WKLAL PMTALRLS WLKLGHS PAFGS I TET 
IVTGERSEYLVTALEPDSPYKVCMVPMETSNLYLFDETPVCIETETAPLRMYNPTTTLNREQEKE 
PYKNPNLPLAAI I GGAVAL VT I ALLALVC WYVHRNGS L FSRNCAYSKGRRRKDDYAEAGTKKDNS 
ILEIRETSFQMLPISNEPISKEEFVIHTIFPPNGMNLYKNNHSESSSNRSYRDSGIPDSDHSHS 

Important features of the protein: 
Signal peptide: 
amino acids 1-28 
Transmembrane domain: 
amino acids 531-552 
N-glycosylation sites. 

amino acids 226-229, 282-285, 296-299, 555-558, 626-629, 633-636 
Tyrosine kinase phosphorylation site, 
amino acids 515-522 
N-myristoylation sites. 

amino acids 12-17, 172-177, 208-213, 359-364, 534-539, 556-561, 
640-645 

Amidation site. 

amino acids 567-570 
Leucine zipper pattern. 

amino acids 159-180 

Phospholipase A2 aspartic acid active site. 

amino acids 34-44 
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FIGURE 133 



CCGTCATCCCCCTGCAGCCACCCTTCCCAGAGTCCTTTGCCCAGGCCACCCCAGGCTTCTTGGCA 
GCCCTGCCGGGCCACTTGTCT-TC ATGT CTGCCAGGGGGAGGTGGGAAGGAGGTGGGAGGAGGGCG 
TGCAGAGGCAGTCTGGGCTTGGCCAGAGCTCAGGGTGCTGAGCGTGTGACCAGCAGTGAGCAGAG 
GCCGGCCATGGCCAGCCTGGGGCTGCTGCTCCTGCTCTTACTGACAGCACTGCCACCGCTGTGGT 
CCTCCTCACTGCCTGGGCTGGACACTGCTGAAAGTAAAGCCACCATTGCAGACCTGATCCTGTCT 
GCGCTGGAGAGAGCCACCGTCTTCCTAGAACAGAGGCTGCCTGAAATCAACCTGGATGGCATGGT 
GGGGGTCCGAGTGCTGGAAGAGCAGCTAAAAAGTGTCCGGGAGAAGTGGGCCCAGGAGCCCCTGC 
H TGCAGCCGCTGAGCCTGCGCGTGGGGATGCTGGGGGAGAAGCTGGAGGCTGCCATCCAGAGATCC 
Zll CTCCACTACCTCAAGCTGAGTGATCCCAAGTACCTAAGAGAGTTCCAGCTGACCCTCCAGCCCGG 
ftl GTTTTGGAAGCTCCCACATGCCTGGATCCACACTGATGCCTCCTTGGTGTACCCCACGTTCGGGC 
fij CC CAGGACT CAT T C T CAGAGGAGAGAAG TGACGT GTGCC TGGT GCAGC T GCTGGGAACC GGGAC G 

SJ GACAGCAGCGAGCCCTGCGGCCTCTCAGAGCTCTGCAGGAGCCTCATGACCAAGCCCGGCTGCTC 
^ AGGCTACTGCCTGTCCCACCAACTGCTCTTCTTCCTCTGGGCCAGAATGAGGGGATGCACACAGG 
* M GACCACTCCAACAGAGCCAGGAC TAT AT CAACC TC TTC TGCGCCAACAT GAT GGAC T TGAACCGC 

f=l AGAGCTGAGGCCATCGGATACGCCTACCCTACCCGGGACATCTTCATGGAAAACATCATGTTCTG 
HI TGGAATGGGCGGCTTCTCCGACTTCTACAAGCTCCGGTGGCTGGAGGCCATTCTCAGCTGGCAGA 
AACAGCAGGAAGGATGCTTCGGGGAGCCTGATGCTGAAGATGAAGAATTATCTAAAGCTATTCAA 
TAT CAGCAGCAT T T T T CGAGGAGAGTGAAGAGGC GAGAAAAACAAT TT C CAGAT TCTCGCTCTGT 
TGCTCAGGCTGGAGTACAGTGGCGCAATCTCGGCTCACTGCAACCTTTGCCTCCTGGGTTCAAGC 
AATTCTCTTGCCTCATCCTCCCGAGTAGCTGGGACTACAGGAGCGTGCCACCATACCTGGCTAAT 
TTTTATATTTTTTTAGTAGAGACAGGGTTTCATCATGTTGCTCATGCTGGTCTCGAACTCCTGAT 
CTCAAGAGATCCGCCCACCTCAGGCTCCCAAAGTGTGGGATT ATAG GTGTGAGCCACCGTGTCTG 
GCTGAAAAGCACTTTCAAAGAGACTGTGTTGAATAAAGGGCCAAGGTTCTTGCCACCCAGCACTC 
ATGGGGGCTCTCTCCCCTAGATGGCTGCTCCTCCCACAACACAGCCACAGCAGTGGCAGCCCTGG 
GTGGCTTCCTATACATCCTGGCAGAATACCCCCCAGCAAACAGAGAGCCACACCCATCCACACCG 
CCACCACCAAGCAGCCGCTGAGACGGACGGTTCCATGCCAGCTGCCTGGAGGAGGAACAGACCCC 
TTTAGTCCTCATCCCTTAGATCCTGGAGGGCACGGATCACATCCTGGGAAGAAGGCATCTGGAGG 
ATAAGCAAAGCCACCCCGACACCCAATCTTGGAAGCCCTGAGTAGGCAGGGCCAGGGTAGGTGGG 
GGCCGGGAGGGACCCAGGTGTGAACGGATGAATAAAGTTCAACTGCAAGTGAAAAAAAAAAA 
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FIGURE 13 4 

MSARGRWEGGGRRACRGSLGLARA.QGAERVTSSEQRPAMASLGLLLLLLLTALPPLWSSSLPGLD 
TAESKAT I ADL I LSALERAT VFLEQRLPE I NLDGMVGVRVLEEQLKS VREKWAQE PLLQPLS LRV 
GMLGEKLEAAIQRSLHYLKLSDPKYLREFQLTLQPGFWKLPHAWIHTDASLVYPTFGPQDSFSEE 
RSDVCLVQLLGTGTDSSEPCGLSDLCRSLMTKPGCSGYCLSHQLLFFLWARMRGCTQGPLQQSQD 
YINLFCANMMDLNRRAEAIGYAYPTRDIFMENIMFCGMGGFSDFYKLRWLEAILSWQKQQEGCFG 
EPDAEDEELSKAIQYQQHFSRRVKRREKQFPDSRSVAQAGVQWRNLGSLQPLPPGFKQFSCLILP 
SSWDYRSVPPYLANFYIFLVETGFHHVAHAGLELLISRDPPTSGSQSVGL 

Important features of the protein: 
Signal peptide: 

amino acids 1-26 

Transmembrane domain: 

amino acids 39-56 

Tyrosine kinase phosphorylation sites. 

amino acids 149-156, 274-282 

N-myristoylation sites. 

amino acids 10-16, 20-26, 63-69, 208-214 

Ami da ti on site. 

amino acids 10-14 

Glycoprotein hormones beta chain signature 1. 

amino acids 230-237 
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FIGURE 13* 



GGTCTGAGTGCAGAGCTGCTGTCATCGCGGCCGCTCTGTGGGGCTTCTTTCCCGTCCTGCTGCTG 
CTGCTGCTATCGGGGGATGTCCAGAGCTCGGAGGTGCCCGGGGCTGCTGCTGAGGGATCGGGAGG 
GAGTGGGGTCGGCATAGGAGATCGCTTCAAGATTGAGGGGCGTGCAGTTGTTCCAGGGGTGAAGC 
CTCAGGACTGGATCTCGGCGGCCCGAGTGCTGGTAGACGGAGAAGAGCACGTCGGTTTCCTTAAG 
ACAGATGGGAGTTTTGTGGTTCATGATATACCTTCTGGATCTTATGTAGTGGAAGTTGTATCTCC 
AGCTTACAGATTTGATCCCGTTCGAGTGGATATCACTTCGAAAGGAAAAATGAGAGCAAGATATG 
TGAAT T ACATCAAAACATCAGAGG T T G T CAGAC TGCCC TAT CCT CT CCAAAT GAAATC T TCAGGT 
^ CCACC T TC T TAC T T TAT T AAAAGGGAAT CGT GGGGC T GGACAGAC T T T C TAATGAACCCAAT GGT 

Cl 

III TATGATGATGGTTCTTCCTTTATTGATATTTGTGCTTCTGCCTAAAGTGGTCAACACAAGTGATC 

it * 

tn CTGACATGAGACGGGAAATGGAGCAGTCAATGAATATGCTGAATTCCAACCATGAGTTGCCTGAT 

•k: ! * 

hi GT T TC TGAGT TCATGACAAGACT CT T C T C T TC AAAAT CATC TGGCAAAT CT AGCAGCGGCAGCAG 

H TAAAACAGGCAAAAGTGGGGCTGGCAAAAGGAGGTAGTCAGGCCGTCCAGAGCTGGCATTTGCAC 

Jyf AAACACGGCAACACT GGGT GGCATCCAAGT C T TGGAAAACCGT GT GAAGCAAC TACTATAAAC T T 

ill 

GAGTCATCCCGACGTTGATCTCTTACAACTGTGTATGTT 
f s l AACTTTTTAGCACATGTTTTGTACTTGGTACACGAGAAAACCCAGCTTTCATCTTTTGTCTGTAT 
||| GAGG TCAATAT TGAT GTCAC T GAAT TAAT TACAGTGTC CTATAGAAAAT GCCAT TAATAAAT TAT 

P AT GAACTAC TATACAT TAT GTATAT TAAT T AAAACAT C TTAATCCAGAAAT CAAAAAAAAAAAAA 

AAAAAAAAAAAAAAA 
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FIGURE 136 

MAAALWGFFPVLLLLLLSGDVQSSEVPGAAAEGSGGSGVGIGDRFKIEGRAWPGVKPQDWISAA 
RVLVDGEEHVGFLKTDGS FWHD I PSGSYWEWSPAYRFDPVRVDI TSKGKMRARYVNYIKTSE 
VVRLPYPLQMKSSGPPSYFIKRESWGWTDFLMNPMVMM^^ 
QSMNMLNSNHEL P DVS E FMT.RL FS SKS.S GKS S S G S SKTGKS GAGKRR 

Important features of the protein: 
Signal sequence: 

amino acids 1-23 

Transmembrane domain: 

amino acids 161-182 

N-glycosylation site. 

amino acids 184-187 

Glycosaminoglycan attachment sites. 

amino acids 37-40, 236-239 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 151-154 

N-myristoylation sites . 

amino acids 33-38, 36-41, 38-44, 229-234 

Amidation site. 

amino acids 238-241 

ATP/GTP-binding site motif A (P-loop) . 

amino acids 229-236 
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FIGURE 137 

GATGGCGCAGCCACAGCTTCTGTGAGATTCGATTTCTCCCCAGTTCCCCTGTGGGTCTGAGGGGA 
CCAGAAGGGTGAGCTACGTTGGCTTTCTGGAAGGGGAGGCTATATGCGTCAATTCCCCAAAACAA 
GTTTTGACATTTCCCCTGAAATGTCATTCTCTATCTATTCACTGCAAGTGCCTGCTGTTCCAGGC 
CTTACCTGCTGGGCACTAACGGCGGAGCCAGGATGGGGACAGAATAAAGGAGCCACGACCTGTGC 
CACCAAC T CGCAC T CAGAC TCT GAAC T C AGACC TGAAAT C T TC T CT T C ACGGGAGGC T T GGCAGT 
TTTTCTTACTCCTGTGGTCTCCAGATTTGAGGCCTAAGATGAAAGCCTCTAGTCTTGCCTTCAGC 
CTTCTCTCTGCTGCGTTTTATCTCCTATGGACTCCTTCCACTGGACTGAAGACACTCAATTTGGG 
AAGCT GT GT GAT CGCCACAAACC T TCAGGAAAT ACGAAAT GGAT T T TC TG AGATACGGGGCAGT G 
T GCAAGCCAAAGATGGAAACAT T GACATCAGAATC TTAAGGAGGAC TGAGTCTT TGCAAGACACA 
AAGCCTGCGAATCGATGCTGCCTCCTGCGCCATTTGCTAAGACTCTATCTGGACAGGGTATTTAA 
AAAC TACCAGACCCC TGACCATTAT AC TCT C CGGAAGATCAGC AGCCT C GCCAAT TCCTTTCTTA 
CCATCAAGAAGGACCTCCGGCTCTCTCATGCCCACATGACATGCCATTGTGGGGAGGAAGCAATG 
AAGAAATACAGCCAGATTCTGAGTCACTTTGAAAAGCTGGAACCTCAGGCAGCAGTTGTGAAGGC 
TTTGGGGGAACTAGACATTCTTCTGCAATGGATGGAGGAGACAGA ATAGG AGGAAAGTGATGCTG 
CTGCTAAGAATATTCGAGGTCAAGAGCTCCAGTCTTCAATACCTGCAGAGGAGGCATGACCCCAA 
ACCACCATCTCTTTACTGTACTAGTCTTGTGCTGGTCACAGTGTATCTTATTTATGCATTACTTG 
CT TCCT TGCATGAT TGTC TT TATGCATCCCCAATC TTAAT TGAGACCATACT TGTATAAGAT TT T 
TGTAATATCTTTCTGCTATTGGATATATTTATTAGTTAATATATTTATTTATTTTTTGCTATTTA 
AT GT AT T TAT T T T T T T AC T T GGACAT GAAAC T T T AAAAAAAT T C ACAG AT TAT AT T TAT AAC C T G 
AC TAGAGCAGGT GATGTAT T T T T AT ACAGT AAAAAAAAAAAACC T TG T AAAT TCTAGAAGAG TGG 
C T AGGGGGG T TAT TCAT T T G TAT T CAAC T AAGGAC AT AT TTACT CAT GC TGAT GC TC TGTGAGAT 
AT T TGAAAT T GAACCAATGACTAC T TAGGAT GGG T TG TGGAATAAGT T T TGAT GT GGAAT TGCAC 
ATCTACCTTACAATTACTGACCATCCCCAGTAGACTCCCCAGTCCCATAATTGTGTATCTTCCAG 
C CAGGAATC CTACAC GGCCAGCAT GTAT TT CTACAAATAAAGT T T TCT T TGCATACCAAAAAAAA 
AAAAAAAAAAA 



APP ID=10063735 



Page 291 of 322 



FIGURE 138 

HRQFPKTSFDISPEMSFSIYSLQVPAVPGLTCWALTAEPGWGQNKGATTCATNSHSDSELRPEIF 
SSREAWQFFLLLWSPDFRPKMKASSLAFSLLSAAFYLLWTPSTGLKTLNLGSCVIATNLQEIRNG 
FSEIRGSVQAKDGNIDIRILRRTESLQDTKPANRCCLLRHLLRLYLDRVFKNYQTPDHYTLRKIS 
S LANS FLT I KKDLRLSHAHMTCHCGEEAMKKYS Q I L S H FEKLE PQAAWKALGELD ILLQWMEE T 
E 

Important features of the protein: 
Signal peptide: 

amino acids 1-42 

cAMP- and cGMP-dependent protein kinase phosphorylation sites. 

amino acids 192-195, 225-228 

N-myristoylation sites . 

amino acids 42-47, 46-51, 136-141 
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FIGURE 12Q 



CCTGGAGCCGGAAGCGCGGCTGCAGCAGGGCGAGGCTCCAGGTGGGGTCGGTTCCGCATCCAGCC 
TAGCGTGTCCACGATGCGGCTGGGCTCCGGGACTTTCGCTACCTGTTGCGTAGCGATCGAGGTGC 
TAGGGATCGCGGTCTTCCTTCGGGGATTCTTCCCGGCTCCCGTTCGTTCCTCTGCCAGAGCGGAA 
CACGGAGCGGAGCCCCCAGCGCCCGAACCCTCGGCTGGAGCCAGTTCTAACTGGACCACGCTGCC 
ACCACCTCTCTTCAGTAAAGTTGTTATTGTTCTGATAGATGCCTTGAGAGATGATTTTGTGTTTG 
GG T CAAAGGG T GT GAAATT TATGCCC TACACAAC T TAC C TTGTG GAAAAAG GAGCAT C TCACAG T 
TTTGTGGCT GAAGCAAAGC CACCT ACAGT TACT ATGCC TCGAAT CAAGGCAT T GATGACGGGGAG 
CCTTCCTGGCTTTGTCGACGTCATCAGGAACCTCAATTCTCCTGCACTGCTGGAAGACAGTGTGA 
T AAGACAAGCAAAAGCAGCT GGAAAAAGAAT AGTC TT TT AT GGAGATGAAAC C1GGG T TAAAT T A 
T T CCCAAAGCAT T T TGT GGAAT AT GAT GGAACAACC T CATT TTT C GTG TC AGAT TACACAGAGGT 
GGAT AATAAT GT CACGAGGCAT T T GGATAAAG TAT T AAAAAG AG GAG AT T G G G AC AT AT TAATCC 
TCCACTACCTGGGGCTGGACCACATTGGCCACATTTCAGGGCCCAACAGCCCCCTGATTGGGCAG 
AAGCT GAGC GAGATGGACAGC GT GCT GAT GAAGAT CCACACCTCACT GCAGT CGAAGGAGAGAGA 
GACGCCTTTACCCAATTTGCTGGTTCTTTGTGGTGACCATGGCATGTCTGAAACAGGAAGTCACG 
GGGCCTCCTCCACCGAGGAGGTGAATACACCTCTGATTTTAATCAGTTCTGCGTTTGAAAGGAAA 
CCCGGTGATATCCGACATCCAAAGCACGTCCAATAGACGGATGTGGCTGCGACACTGGCGATAGC 
ACTTGGCTTACCGATTCCAAAAGACAGTGTAGGGAGCCTCCTATTCCCAGTTGTGGAAGGAAGAC 
CAATGAGAGAGCAGTTGAGAT TT T TACATT TGAAT ACAGT GC AG CTTAGTAAAC TGT TGCAAGAG 
AATGTGCCGTCATATGAAAAAGATCCTGGGTTTGAGCAGTTTAAAATGTCAGAAAGATTGCATGG 
GAACT GGATCAGAC TGTAC T T GGAGGAAAAGCATT CAGAAGT CC TATT CAAC C TGGGCT CCAAGG 
TTCTCAGGCAGTACCTGGATGCTCTGAAGACGCTGAGCTTGTCCCTGAGTGCACAAGTGGCCCAG 
TTCTCACCCTGCTCCTGCTCAGCGTCCCACAGGCACTGCACAGAAAGGCTGAGCTGGAAGTCCCA 
CTGTCATCTCCTGGGTTTTCTCTGCTCTTTTATTTGGTGATCCTGGTTCTTTCGGCCGTTCACGT 
CATTGTGTGCACCTCAGCTGAAAGTTCGTGCTACTTCTGTGGCCTCTCGTGGCTGGCGGCAGGCT 
GCCTTTCGTTTACCAGACTCTGGTTGAACACCTGGTGTGTGCCAAGTGCTGGCAGTGCCCTGGAC 
AGGGGGCCTCAGGGAAGGACGTGGAGCAGCCTTATCCCAGGCCTCTGGGTGTCCCGACACAGGTG 
T T CACAT CT GTGC T GT CAGGT CAGATGCCT CAGT T C TT GGAAAGC T AGG T TC C TGCGAC TG TTAC 
CAAGGTGATTGTAAAGAGCTGGCGGTCACAGAGGAACAAGCCCCCCAGCTGAGGGGGTGTGTGAA 
TCGGACAGCC TCCCAGCAGAGGT G T GGGAGCTGCAGCT GAGGGAAGAAGAGACAATC GGCC TGGA 
CACTCAGGAGGGTCAAAAGGAGACTTGGTCGCACCACTCATCCTGCCACCCCCAGAATGCATCCT 
GCCTCATCAGGTCCAGATTTCTTTCCAAGGCGGACGTTTTCTGTTGGAATTCTTAGTCCTTGGCC 
TCGGACACCTTCATTCGTTAGCTGGGGAGTGGTGGTGAGGCAGTGAAGAAGAGGCGGATGGTCAC 
ACTCAGATCCACAGAGCCCAGGATCAAGGGACCCACTGCAGTGGCAGCAGGACTGTTGGGCCCCC 
ACCCCAACCCTGCACAGCCCTCATCCCCTCTTGGCTTGAGCCGTCAGAGGCCCTGTGCTGAGTGT 
CTGACCGAGACAC TCACAGCT TT GTCAT CAGGGCACAGGCTTCCTCGGAGCCAGGATGATC TGTG 
CCACGCTTGCACCTCGGGCCCATCTGGGCTCATGCTCTCTCTCCTGCTATTGAATTAGTACCTAG 
CTGCACACAG TAT GTAGT TAC CAAAAGAAT AAACGGC AATAAT T GAG AAAAAAAA 
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FIGURE 14Q 



MRLGSGTFATCCVAIEVLGIAVFLRGFFPAPVRSSARAEHGAEPPAPEPSAGASSNWTTLPPPLF 
SKWIVLIDALRDDFVFGSKGVKFMPYTTYLVEKGASHSFVAEAKPPTVTMPRIKALMTGSLPGF 
VDVIRNLNSPALLEDSVIRQAKAAGKRIVFYGDETWVKLFPKHFVEYDGTTSFFVSDYTEVDNNV 
TRHLDKVLKRGDWDIL ILHYLGLDHI GHI SGPNS PL I GQKLSEMDSVLMKI HT SLQSKERE TPLP 
NLLVLCGDHGMSETGSHGASSTEEVNTPLILISSAFERKPGDIRHPKHVQ 

Important features of the protein: 
Signal peptide: 

amino acids 1-34 

£K r. 

|gl Transmembrane domain: 

amino acids 58-7 6 

N-glycosylation sites. 

amino acids 56-60, -194-198 

N-myristoylation sites. 

amino acids 6-12, 52-58, 100-106, 125-131, 233-239, 270-276,. 
275-281, 278-284 

Amidation site. 

amino acids 154-158 

Cell attachment sequence. 

amino acids 205-208 
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FIGURE 141 



GGCACGAGGCAAGCCTTCCAGGTTATCGTGACGCACCTTGAAAGTCTGAGAGCTACTGCCCTACA 
GAAAGTTACTAGTGCCCTAAAGCTGGCGCTGGCACTG ATG TTACTGCTGCTGTTGGAGTACAACT 
T C C C T AT AGAAAACAAC TGCCAGCACC T T AAGACCAC TCAC ACC T T CAGAG T GAAGAAC T T AAAC 
CCGAAGAAAT TCAGC AT TCATGACCAGGAT CACAAAGTAC T GG T CC T GGAC T C T GGGAAT C T CAT 
AGC AG T T C C AGAT AAAAAC TACAT ACGCCCAGAGATC T T C T T T GC AT T AGC C T CATCCT T GAGC T 
CAGCCTCTGCGGAGAAAGGAAGTCCGATTCTCCTGGGGGTCTCTAAAGGGGAGTTTTGTCTCTAC 
T GT GACAAGGATAAAGGACAAAGT CAT CCATCCC TTCAGC T GAAGAAGGAGAAAC TGAT GAAGCT 
GGCTGCCCAAAAGGAATCAGCACGCCGGCCCTTCATCTTTTATAGGGCTCAGGTGGGCTCCTGGA 
ACATGCTGGAGTCGGCGGCTCACCCCGGATGGTTCATCTGCACCTCCTGCAATTGTAATGAGCCT 
G T T GGGGTGACAGATAAAT T TGAGAACAGGAAACACAT TGAAT T T T CAT T T C AACCAG T T TGCAA 
AGCTGAAATGAGCCCCAGTGAGGTCAGCGATTAGGAAACTGCCCCATTGAACGCCTTCCTCGCTA 
ATTTGAACTAAT TGTATAAAAACACCAAACCTGC TCAC T 
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FIGURE 142 



MLLLLLE YNFP I ENNCQHLKTTHT FRVKNLNPKKFS I HDQDHKVLVLDSGNL I AVPDKNY IRPEI 
F FALAS S L S S AS AEKG S P I LLGVS KGE FCL YCDKDKGQS HPS LQLKKEKLMKLAAQKE S ARR P F I 
FYRAQVGSWNMLESAAHPGWFICTSCNCNEPVGVTDKFENRKHIEFS FQPVCKAEMSPSEVSD 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 33-36 

N-myristoylation site. 

amino acids 50-55, 87-92 

Interleukin-1 

amino acids 37-182 
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FIGURE 143 



C T AGAGAGT AT AGGGCAGAAGGAT GGCAGAT GAGTGAC TCCACAT CCAGAGCTGCC TCCCT T T AA 
TCCAGGATCCTGTCCTTCCTGTCCTGTAGGAGTGCCTGTTGCCAGTGTGGGGTGAGACAAGTTTG 
TCCCACAGGGCTGTCTGAGCAGATAAGATTAAGGGCTGGGTCTGTGCTCAATTAACTCCTGTGGG 
CACGGGGGCTGGGAAGAGCAAAGTCAGCGGTGCCTACAGTCAGCACCATGCTGGGCCTGCCGTGG 
AAGGGAGGTCTGTCCTGGGCGCTGCTGCTGCTTCTCTTAGGCTCCCAGATCCTGCTGATCTATGC 
CTGGCATTTCCACGAGCAAAGGGACTGTGATGAACACAATGTCATGGCTCGTTACCTCCCTGCCA 
CAGTGGAGT TT GCTGTCCACACAT TCAACCAACAGAGCAAGGACTACTATGCC TACAGACTGGGG 
1^ CACATC TT GAAT TCC TGGAAGGAGCAGGTGGAGTCCAAGACTGTATT CTCAATGGAGC T AC TGCT 

13 GGGGAGAACT AGGTGT GGGAAAT T T GAAGACGACAT T GACAACT GCCAT T T C CAAGAAAGCACAG 

Q AGCTGAACAATACT T TCACCTGCTT CT TCACCATCAGCACCAGGCCCT GGATGACTCAGTTCAGC 

J* t CT CCT GAACAAGACC T GC T TGGAGGGAT T C CAC TGAGTGAAACCCACT CACAGGC T TGTCCAT GT 

H GCTGCTCCCACATTCCGTGGACATCAGCACTACTCTCCTGAGGACTCTTCAGTGGCTGAGCAGCT 
yj TTGGACTTGTTTGTTATCCTATTTTGCATGTGTTTGAGATCTCAGATCAGTGTTTTAGAAAATCC 
|fl AC ACAT C T T G AGC C T AAT CAT G TAG T G TAG AT CAT T AAAC AT CAG CAT T T TAAGAAAAAAAAAAA 

5 AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^ 

w 

;: 

Ui 
w 
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FIGURE iaa 



MLGLPWKGGLSWALLLLLLGSQILLIYAWHFHEQRDCDEHNVMARYLPATVEFAVHTFNQQSKDY 
YAYRLGHILNSWKEQVESKTVFSMELLLGRTRCGKFEDDIDNCHFQESTELNNTFTCFFTISTRP 
WMTQFS LLNKTCLEG FH \ 

Important features of the protein: 
Signal peptide: 

amino acids 1-25 



jpj N-glycosylation sites. 

Cl amino acids 117-121, 139-143 



N-myristoylation site. 

amino acids 9-15 
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FIGURE 145 



CT GTGCAGC T CGAGGCT CCAGAGGCACACT CCAGAGAGAGC CAAGG T T C TGACGC GATGAGGAAG 
CACCTGAGCTGGTGGTGGCTGGCCACTGTCTGCATGCTGCTCTTCAGCCACCTCTCTGCGGTCCA 
GACGAGGGGCATCAAGCACAGAATCAAGTGGAACCGGAAGGCCCTGCCCAGCACTGCCCAGATCA 
CTGAGGCCCAGGTGGCTGAGAACCGCCCGGGAGCCTTCATCAAGCAAGGCCGCAAGCTCGACATT 
GACTTCGGAGCCGAGGGCAACAGGTACTACGAGGCCAACTACTGGCAGTTCCCCGATGGCATCCA 
CTACAACGGC TGC T C TGAGGCTAAT GT GAC CAAGGAGGCAT T T GT CAC C GGCT GCAT CAATGCCA 
CCCAGGCGGCGAACCAGGGGGAGTTCCAGAAGCCAGACAACAAGCTCCACCAGCAGGTGCTCTGG 
jU ? CGGCTGGTCCAGGAGCTCTGCTCCCTCAAGCATTGCGAGTTTTGGTTGGAGAGGGGCGCAGGACT 

D TCGGGTCACCATGCACCAGCCAGTGCTCCTCTGCCTTCTGGCTTTGATCTGGCTCATGGTGAAAT 

A 

AAG C T T G C C AGGAG GC T GGCAG T AC AGAG C G C AGC AG C GAGCAAAT C C T G GC AAG T GAC C CAGC T 
™ I CTTCTCCCCCAAACCCACGCGTGTTCTGAAGGTGCCCAGGAGCGGCGATGCACTCGCACTGCAAA 
iy TGCCGCTCCCACGTATGCGCCCTGGTATGTGCCTGCGTTCTGATAGATGGGGGACTGTGGCTTCT 
III CCGTCACTCCATTCTCAGCCCCTAGCAGAGCGTCTGGCACACTAGATTAGTAGTAAATGCTTGAT 
111 GAGAAGAACACAT CAGGCAC TGCGCCACCT GC T T CACAGTACT TCCCAACAAC TC T TAGAGG TAG 

GT GTAT T CCCGT T T TACAGATAAGGAAAC T GAGGCCCAGAGAGCT GAAGTAC TGCAC CCAGCAT C 
ACCAGCTAGAAAGTGGCAGAGCCAGGATTCAACCCTGGCTTGTCTAACCCCAGGTTTTCTGCTCT 
GTCCAATTCCAGAGCTGTCTGGTGATCACTTTATGTCTCACAGGGACCCACATCCAAACATGTAT 
CT CTAAT GAAAT T GTGAAAGC TC CATG T TT AGAAATAAATGAAAACAC C T GA 



I., 5 
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FIGURE 146 



MRKHLSWWWLATVCMLLFSHLSAVQTRGIKHRIKWNRKALPSTAQITEAQVAENRPGAFIKQGRK 
LDIDFGAEGNRYYEANYWQFPDGIHYNGCSEANVTKEAFVTGCINATQAANQGEFQKPDNKLHQQ 
VLWRLVQELCSLKHCEFWLERGAGLRVTMHQPVLLCLIALIWLMVK 



Important features of the protein: 
Signal peptide: 

amino acids 1-26 

Transmembrane domain : 

amino acids 157-171 

N-glycosylation sites . 

amino acids 98-102, 110-114 

Tyrosine kinase phosphorylation site. 

amino acids 7 6-83 

N-myristoylation sites. 

amino acids 71-77, 88-94, 93-99, 107-113, 154-160 



Ami da ti on site. 

amino acids 62-66 
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FIGURE 147 



GCCTTGGCCTCCCAAAGGGCTGGGATTATAGGCGTGACCACCATGTCTGGTCCAGAGTCTCATTT 
CCTGATGATTTATAGACTCAAAGAAAACTCATGTTCAGAAGCTCTCTTCTCTTCTGGCCTCCTCT 
CTGTCTTCTTTCCCTCTTTCTTCTTATTTTAATTAGTAGCATCTACTCAGAGTCATGCAAGCTGG 
AAATCTTTCATTTTGCTTGTCAGTGGGGTAGGTCACTGAGTCTTAGTTTTTATTTTTTGAAATTT 
CAACTTTCAGATTCAGGGGGTACATGTGAAGGTTTGTTTTATGAGTATATTGCATGATGCTGAGG 
TTTGGGGT 
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FIGURE 148 



MFRSSLLFWPPLCLLSLFLLILISSIYSESCKLEIFHFACQWGRSLSLSFY FLKFQLSDSGGTCE 
GLFYEYIA 



Important features of the protein: 
Signal peptide: 

amino acids 1-25 



jpsji N-myristoylation site. 

hsl amino acids 62-68 

O 

m 

H 
w 
m 

8 

© 

HI 

o 

CO 

o 

fij 
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FIGURE 14Q 



GTCTCCGCGTCACAGGAACTTCAGCACCCACAGGGCGGACAGCGCTCCCCTCTACCTGGAGACTTGAC 
TCCCGCGCGCCCCAACCCTGCTTATCCCTTGACCGTCGAGTGTCAGAGATCCTGCAGCCGCCCAGTCC 
CGGCCCCTCTCCCGCCCCACACCCACCCTCCTGGCTCTTCCTGTTTTTACTCCTCCTTTTCATTCATA 
ACAAAAGCTACAGCT CCAGGAGCCCAGCGCC GGGCT GTGAC CCAAGC CGAGCGTGGAAGAATGGGGT T 
CCTCGGGACCGGCACTTGGATTCTGGTGTTAGTGCTCCCGATTCAAGCTTTCCCCAAACCTGGAGGAA 
GCCAAGACAAAT CT C TACATAATAGAGAATTAAGT G CAGAAAGACCTTTGAAT GAACAGATTGCT GAA 
GCAGAAGAAGACAAGAT TAAAAAAACATAT CCTCCAGAAAACAAGCCAGGT CAGAGCAACTAT TCTT T 
T GTT GATAACT TGAACCTGCTAAAGGCAATAACAGAAAAGGAAAAAAT TGAGAAAGAAAGACAATCT A 
TAAGAAGCTCCCCACTTGATAATAAGTTGAATGTGGAAGATGTTGATTCAACCAAGAATCGAAAACTG 
ATCGATGATTATGACTCTACTAAGAGTGGATTGGATCATAAATTTCAAGATGATCCAGATGGTCTTCA 
TCAACTAGACGGGACTCCTTTAACCGCTGAAGACATTGTCCATAAAATCGCTGCCAGGATTTATGAAG 
AAAATGACAGAGCCGTGTTTGACAAGATTGTTTCTAAACTACTTAATCTCGGCCTTATCACAGAAAGC 
CAAGCAC ATACACTG GAAGAT GAAGTAGCAGAGGTTTTACAAAAATTAATC T CAAAGGAAGCCAACAA 
T TAT GAGGAGGATCCCAATAAGCCCACAAGCT GGACTGAGAA.TCAGGCTGGAAAAAT ACCAGAGAAAG 
T GACT CCAATGGCAGCAATT CAAGATGGTCT TGCTAAGGGAGAAAACGATGAAACAGTAT CTAACACA 
s . T TAACCT T GACAAAT GGCTT GGAAAGGAGAACTAAAACCTACAGT GAAGACAACTTT GAGGAACTCCA 

C? ATATT TCCCAAATTT CTATGCGCTACT GAAAAGTATT GATT CAGAAAAAGAAGCAAAAGAGAAAGAAA 

yj CACTGAT T ACT ATCATGAAAACACT GATTGACTTTGT GAAGAT GAT GGTGAAATATGGAACAAT AT C T 

C CAGAAGAAGGT GT T T C C T AC C T T GAAAACT T G GAT GAAAT GAT T GC T CT T CAGAC CAAAAAC AAGC T 
AGAAAAAAATGCTACTGACAATATAAGCAAGCTTTTCCCAGCACCATCAGAGAAGAGTCATGAAGAAA 
CAGAC AGTACCAAGGAAGAAGCAGCTAAGAT GGAAAAGGAATAT GGAAGCT TGAAGGATT CCACAAAA 
GATGATAACTCCAACCCAGGAGGAAAGACAGATGAACCCAAAGGAAAAACAGAAGCCTATTTGGAAGC 
CATCAGAAAAAATATTGAATGGTTGAAGAAACATGACAAAAAGGGAAATAAAGAAGATTATGACCTTT 
CAAAGATGAGAGACTTCATCAATAAACAAGCTGATGCTTATGTGGAGAAAGGCATCCTTGACAAGGAA 
GAAGCCGAGGCCATCAAGCGCATTTATAGCAGCCTGTAAAAATGGCAAAAGATCCAGGAGTCTTTCAA 
CTGTTTCAGAAAACATAATATAGCTTAAAACACTTCTAATTCTGTGATTAAAATTTTTTGACCCAAGG 
GTTATTAGAAAGTGCTGAATTTACAGTAGTTAACCTTTTACAAGTGGTTAAAACATAGCTTTCTTCCC 
GTAAAAACTATCTGAAAGTAAAGT TGTAT GTAAGCTGAAAAAAAAAAAAAAAAAAAA 



W 
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FIGURE ISO 

MGFLGTGTWILVLVLPIQAFPKPGGSQDKSLHNRELSAERPLNEQIAEAEEDKIKKTYPPENKPG 
QSNYSFVDNLNLLKAITEKEKIEKERQSIRSSPLDNKLNVEDVDSTKNRKLIDDYDSTKSGLDHK 
FQDDPDGLHQLDGTPLTAEDIVHKIAARIYEENDRAVFDKIVSKLLNLGLITESQAHTLEDEVAE 
VLQKLISKEANNYEEDPNKPTSWTENQAGKIPEKVTPMAAIQDGLAKGENDETVSNTLTLTNGLE 
RRTKT YSEDNFEELQYFPNFYALLKS I DSEKEAKEKE TL I T IMKTLI DFVKMMVK YGT I S PEEGV 
SYLENLDEMIALQTKNKLEKNATDNISKLFPAPSEKSHEETDSTKEEAAKMEKEYGSLKDSTKDD 
NSNPGGKTDEPKGKTEAYLEAIRKNIEWLKKHDKKGNKEDYDLSKMRDFINKQADAYVEKGILDK 
EEAEAIKRIYSSL 

N-glycosylation sites: 

amino acids 68-71, 346-349, 350-353 

Casein kinase II phosphorylation site: 

amino acids 70-73, 82-85, 97-100, 125-128, 147-150, 188-191, 217- 

220, 265-268, 289-292, 305-308, 320-323, 326-329, 362-365, 368- 
341, 369-372, 382-385, 386-389, 387-390 

N-myristoylation sites: 

amino acids 143-148, 239-244 
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FIGURE 151 



CGGCTCGAGGCTCCCGCCAGGAGAAAGGAACATTCTGAGGGGAGTCTACACCCTGTGGAGCTCAA 
GATGGTCCTGAGTGGGGCGCTGTGCTTCCGAATGAAGGACTCGGCATTGAAGGTGCTTTATCTGC 
ATAATAACGAGCTTCTAGCTGGAGGGCTGCATGCAGGGAAGGTCATTAAAGGTGAAGAGATCAGC 
GTGGTCCCCAATCGGTGGCTGGATGCCAGCCTGTCCCCCGTCATCCTGGGTGTCCAGGGTGGAAG 
CCAGTGCCTGTCATGTGGGGTGGGGCAGGAGCCGAGTCTAACACTAGAGCCAGTGAACATCATGG 
AGCTCTATCTTGGTGCCAAGGAATCCAAGAGCTTCACCTTCTACCGGCGGGACATGGGGCTCACC 
TCCAGCTTCGAGTCGGCTGCCTACCCGGGCTGGTTCCTGTGCACGGTGCCTG7yVGCCGATCAGCC 
TGTCAGACTCACCCAGCTTCCCGAGAATGGTGGCTGGAATGCCCCCATCACAGACTTCTACTTCC 
AGCAGTGTGACTAGGGCAACGTGCCCCCCAGAACTCCCTGGGCAGAGCCAGCTCGGGTGAGGGGT 
GAGTGGAGGAGACCCATGGCGGACAATCACTCTCTCTGCTCTCAGGACCCCCACGTCTGACTTAG 
TGGGCACCTGACCACTTTGTCTTCTGGTTCCCAGTTTGGATAAATTCTGAGATTTGGAGCTCAGT 
y„ CCACGGTCCTCCCCCACTGGATGGTGCTACTGCTGTGGAACCTTGTAAAAACCATGTGGGGTAAA 
!I CTGGGAATAACATGAAAAGATTTCTGTGGGGGTGGGGTGGGGGAGTGGTGGGAATCATTCCTGCT 
y TAATGGTAACTGACAAGTGTTACCCTGAGCCCCGCAGGCCAACCCATCCCCAGTTGAGCCTTATA 
Q GGGTCAGTAGCTCTCCACATGAAGTCCTGTCACTCACCACTGTGCAGGAGAGGGAGGTGGTCATA 
|f! GAGTCAGGGATCTATGGCCCTTGGCCCAGCCCCACCCCCTTCCCTTTAATCCTGCCACTGTCATA 
1 , 5 TGCTACCTTTCCTATCTCTTCCCTCATCATCTTGTTGTGGGCATGAGGAGGTGGTGATGTCAGAA 
^1 GAAATGGCT CGAGCTCAGAAGATAAAAGATAAGTAGGGTATGCTGATCCT CTT T TAAAAACCCAA 

GATACAATCAAAAT CCCAGATGCT GGT C T C T AT TCCCAT GAAAAAGTGC T CATGACATAT TGAGA 
UJ AGACC T AC T TAC AAAG T GGC AT AT AT T GC AAT T TAT T T T AAT T AAAAGAT AC C TAT T TAT AT AT T 

HI TCTTTATAGAAAAAAGTCTGGAAGAGTTTACTTCAA.TTGTAGCAATGTCAGGGTGGTGGCAGTAT 
1 ' AGGTGATTTTTCTTTTAATTCTGTTAATTTATCTGTATTTCCTAATTTTTCTACAATGAAGATGA 
ATTCCTTGTATAAAAATAAGAAAAGAAATTAATCTTGAGGTAAGCAGAGCAGACATCATCTCTGA 
W TTGTCCTCAGCCTCCACTTCCCCAGAGTAAATTCAAATTGAATCGAGCTCTGCTGCTCTGGTTGG 
l?l TTGTAGTAGTGATCAGGAAACAGATCTCAGCAAAGCCACTGAGGAGGAGGCTGTGCTGAGTTTGT 
Q GTGGCTGGAATCTCTGGGTAAGGAACTTAAAGAACAAAAATCATCTGGTAATTCTTTCCTAGAAG 
f|| GATCACAGCCCCTGGGATTCCAAGGCATTGGATCCAGTCTCTAAGAAGGCTGCTGTACTGGTTGA 
3 ATTGTGTCCCCCTCAAATTCACATCCTTCTTGGAATCTCAGTCTGTGAGTTTATTTGGAGATAAG 
y GTCTCTGCAGATGTAGTTAGTTAAGACAAGGTCATGCTGGATGAAGGTAGACCTAAATTCAATAT 
III GACTGGTTTCCTTGTATGAAAAGGAGAGGACACAGAGACAGAGGAGACGCGGGGAAGACTATGTA 
AAGAT GAAGGCAGAGAT CGGAGT T TTGC AGCCACAAGC TAAGAAACACCAAGGAT TG TGGCAACC 
ATCAGAAGCT TGGAAGAGGCAAAGAAGAAT TC TTCCC T AGAGGCT TTAGAGGGATAACGGC TC TG 
C TGAAACCT T AAT CTCAGAC T TCCAGC C TCC T GAACGAAGAAAGAAT AAAT T T CGGC TGT T TTAA 
G C C AC C AAGG AT AAT T G G T TAC AG C AG C T C TAG GAAAC T AAT AC AGC T G C T AAAAT GAT C C C T G T 
CTCCTCGTGTTTACATTCTGTGTGTGTCCCCTCCCACAATGTACCAAAGTTGTCTTTGTGACCAA 
TAGAATATGGCAGAAGTGATGGCATGCCACTTCCAAGATTAGGTTATAAAAGACACTGCAGCTTC 
TACTTGAGCCCTCTCTCTCTGCCACCCACCGCCCCCAATCTATCTTGGCTCACTCGCTCTGGGGG 
AAGCTAGCTGCCATGCTATGAGCAGGCCTATAAAGAGACTTACGTGGTAAAAAATGAAGTCTCCT 
GC CCACAGCCACAT TAGTGAACC TAGAAGCAGAGAC T C TGT GAGATAAT C GAT GTTTGTTGTTTT 
AAG T T G C T CAG TTTTGGTC TAAC T T GT TAT G CAGC A&.T AGATAAAT AAT AT G C AGAGAA&GAG 
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FIGURE 152 



i n 



MVLSGALCFRMKDSALKVLYLHMQLLAGGLHAGKVIKGEEISWPNRWLDASLSPVILGVQGGS 
QCLSCGVGQEPTLTLEPVNIMELYLGAKESKSFTFYRRDMGLTSSFESAAYPGWFLCTVPEADQP 
VRLTQLPENGGWNAPITDFYFQQCD 

N-myristoylation sites. 

amino acids 29-34, 30-35, 60-65, 63-63, 73-78, 91-96, 106-111 

Interleukin-1 signature. 

amino acids 111-131 

Interleukin-1 proteins. 

amino acids 8-29, 83-120, 95-134, 64-103 
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FIGURE isa 



C T T CAGAACAGGT TC T CC T TCCCCAGT CAC CAG T TGC T CGAGT T AGAAT TG T C T GC AATGGCCGC 
CCTGCAGAAATCTGTGAGCTCTTTCCTTATGGGGACCCTGGCCACCAGCTGCCTCCTTCTCTTGG 
CCCTCTTGGTACAGGGAGGAGCAGCTGCGCCCATCAGCTCCCACTGCAGGCTTGACAAGTCCAAC 
TTCCAGCAGCCCTATATCACCAACCGCACCTTCATGCTGGCTAAGGAGGCTAGCTTGGCTGATAA 
CAACACAGACGTTCGTCTCATTGGGGAGAAACTGTTCCACGGAGTCAGTATGAGTGAGCGCTGCT 
ATCTGATGAAGCAGGTGCTGAACTTCACCCTTGAAGAAGTGCTGTTCCCTCAATCTGATAGGTTC 
CAGCCTTATATGCAGGAGGTGGTGCCCTTCCTGGCCAGGCTCAGCAACAGGCTAAGCACATGTCA 
TAT T GAAGGTGATGACCTGCATATCCAGAGGAAT GTGCAAAAGCT GAAGGACACAGTGAAAAAGC 
T T GGAGAGAGTGGAGAGATCAAAGCAAT T GG AGAAC T GGAT TTGCTGTT TAT GTC TC TGAGAAAT 
GC CT GCAT T TGACCAGAGCAAAGCT GAAAAATGAATAAC TAACCC C CT T TCCC TGC TAGAAATAA 
CAAT TAGATGCC CCAAAGC GATT T T T T T T AACCAAAAGGAAGAT GG GAAGCCAAAC TCCATCATG 
ATGGGTGGATTCCAAATGAACCCCTGCGTTAGTTACAAAGGAAACCAATGCCACTTTTGTTTATA 
AGACCAGAAGGTAGACT TTCTAAGCATAGAT AT TTATTGAT AACAT T TCAT T GTAACTGGTGTTC 
TATACACAGAAAACAAT TTATTTT TTAAATAATTGTC TTTT TCCATAAAAAAGATTACTTTCCAT 
TCCTTTAGGGGAAAAAACCCCTAAATAGCTTCATGTTTCCATAATCAGTACTTTATATTTATAAA 
T GT AT T TAT TAT TAT TATAAGAC T GCAT T T TAT T TAT AT CAT T T TAT T AATAT GGAT T T AT T TAT 
AG AAAC AT CAT T C GAT AT T GC T AC T T GAG T G T AAGGC T AAT AT T GAT AT T TAT GAC AAT AAT TAT 
AGAGC TAT AACAT GT T TAT T TGACCT CAAT AAACAC T TGGATAT CCC 



m 
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FIGURE 154 

MAALQKS VS S FLMGTLATSCLLLLALLVQGGAAAP I S SHCRLDKSNFQQPY I TNRT FMLAKEASL 
ADNNTDVRLIGEKLFHGVSMSERCYLMKQVLNFTLEEVLFPQSDRFQPYMQEWPFLARLSNRLS 
TCHIEGDDLHIQRNVQKLKDTVKKLGESGEIKAIGELDLLFMSLRNACI 

Important features of the protein: 
Signal peptide: 
amino acids 1-33 

N-giycosylation sites. 

amino acids 54-58, 68-72, 97-101 

N-myristoylation sites. 

amino, acids 14-20, 82-88 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 10-21 
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FIGURE 155 



GGCTTGCTGAAAATAAAATCAGGACTCCTAACCTGCTCCAGTCAGCCTGCTTCCACGAGGCCTGT 
CAGTCAGTGCCCGACTTGTGACTGAGTGTGCAGTGCCCAGCATGTACCAGGTCAGTGCAGAGGGC 
TGCCTGAGGGCTGTGCTGAGAGGGAGAGGAGCAGAGATGCTGCTGAGGGTGGAGGGAGGCCAAGC 
TGCCAGGTTTGGGGCTGGGGGCCAAGTGGAGTGAGAAACTGGGATCCCAGGGGGAGGGTGCAGAT 
GAGGGAGCGACCCAGATTAGGTGAGGACAGTTCTCTCATTAGCCTTTTCCTACAGGTGGTTGCAT 
TCTTGGCAATGGTCATGGGAACCCACACCTACAGCCACTGGCCCAGCTGCTGCCCCAGCAAAGGG 
CAGGACACCTCTGAGGAGCTGCTGAGGTGGAGCACTGTGCCTGTGCCTCCCCTAGAGCCTGCTAG 
GCCCAACCGCCACCCAGAGTCCTGTAGGGCCAGTGAAGATGGACCCCTCAACAGCAGGGCCATCT 
CCCCCTGGAGATATGAGTTGGACAGAGACTTGAACCGGCTCCCCCAGGACCTGTACCACGCCCGT 
TGCCTGTGCCCGCACTGCGTCAGCCTACAGACAGGCTCCCACATGGACCCCCGGGGCAACTCGGA 
GCTGCTCTACCACAACCAGACTGTCTTCTACAGGCGGCCATGCCATGGCGAGAAGGGCACCCACA 
AGGGCTACTGCCTGGAGCGCAGGCTGTACCGTGTTTCCTTAGCTTGTGTGTGTGTGCGGCCCCGT 
GTGATGGGCTAGCCGGACCTGCTGGAGGCTGGTCCCTTTTTGGGAAACCTGGAGCCAGGTGTACA 
ACCACTTGCCATGAAGGGCCAGGATGCCCAGATGCTTGGCCCCTGTGAAGTGCTGTCTGGAGCAG 
, g CAGGATCCCGGGACAGGATGGGGGGCTTTGGGGAAAACCTGCACTTCTGCACATTTTGAAAAGAG 
13 CAGCTGCTGCTTAGGGCCGCCGGAAGCTGGTGTCCTGTCATTTTCTCTCAGGAAAGGTTTTCAAA 
"111 GTTCTGCCCATTTCTGGAGGCCACCACTCCTGTCTCTTCCTCTTTTCCCATCCCCTGCTACCCTG 

?:! GC C CAGCACAGGCACT T TC TAGATAT TTCCCCCTT GC T GGAGAAGAAAGAGCC CC TGG T T TTAT T 

CO 

H TGTTTGTTTACTCATCACTCAGTGAGCATCTACTTTGGGTGCATTCTAGTGTAGTTACTAGTCTT 
?3j T T GACAT G GAT GAT T C T GAG GAG GAAG C T G T TAT T GAAT GT AT AGAGAT T TAT C C AAAT AAAT AT 

CT TT AT TT AAAAATGAAAAA 
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FIGURE 156 

MRERPRLGEDSSLISLFLQWAFLAMVMGTHTYSHWPSCCPSKGQDTSEELLRWSTVPVPPLEPA 
RPNRHPESCRASEDGPLNSRAISPWRYELDRDLNRLPQDLYHARCLCPHCVSLQTGSHMDPRGNS 
ELLYHNQTVFYRRPCHGEKGTHKGYCLERRLYRVSLACVCVRPRVMG 

Important features of the protein: 
Signal peptide: 

amino acids 1-32 

N-glycosylation site. 

amino acids 136-140 

Tyrosine kinase phosphorylation site. 

amino acids 127-135 

N-myristoylation sites. 

amino acids 44-50, 150-156 
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FIGURE 157 



CCGGCG ATG TCGCTCGTGCTGCTAAGCCTGGCCGCGCTGTGCAGGAGCGCCGTACCCCGAGAGCC 
GACCGTTCAATGTGGCTCTGAAACTGGGCCATCTCCAGAGTGGATGCTACAACATGATCTAATCC 
CCGGAGACT TGAGGGACC TCCGAGTAGAACCTGTTACAACTAGTGTTGCAACAGGGGAC TAT TCA 
AT T TT GATGAATGTAAGC TGGGTACTCCGGGCAGATGCCAGCATCCGCTTGT TGAAGGCCACCAA 
GATTTGTGTGACGGGCAAAAGCAACTTCCAGTCCTACAGCTGTGTGAGGTGCAATTACACAGAGG 
CCTTCCAGACTCAGACCAGACCCTCTGGTGGTAAATGGACATTTTCCTACATCGGCTTCCCTGTA 
GAGCTGAACACAGTCTATTTCATTGGGGCCCATAATATTCCTAATGCAAATATGAATGAAGATGG 
s , CCC TT CCAT GTC TGTGAAT T T CAC C T CAC CAGGC T GCC TAGACCACATAATGAAATATAAAAAAA 

Q AGTGTGTCAAGGCCGGAAGCCTGTGGGATCCGAACATCACTGCTTGTAAGAAGAATGAGGAGACA 
0 GTAGAAGTGAACTTCACAACCACTCCCCTGGGAAACAGATACATGGCTCTTATCCAACACAGCAC 
f I TATCATCGGGTTTTCTCAGGTGTTTGAGCCACACCAGAAGAAACAAACGCGAGCTTCAGTGGTGA 
TTCCAGTGACTGGGGATAGTGAAGGTGCTACGGTGCAGCTGACTCCATATTTTCCTACTTGTGGC 
fsj AGCGACTGCATCCGACATAAAGGAACAGTTGTGCTCTGCCCACAAACAGGCGTCCCTTTCCCTCT 
GGATAACAACAAAAGCAAGCCGGGAGGCTGGCTGCCTCTCCTCCTGCTGTCTCTGCTGGTGGCCA 
* CAT GGG TGCT GGTGGCAGGGATCT AT CTAAT GTGGAGGCACGAAAGGAT CAAGAAGAC T T CC T T T 

;:f TCTACCACCACACTACTGCCCCCCATTAAGGTTCTTGTGGTTTACCCATCTGAAATATGTTTCCA 

10 

«* TCACACAATTTGTTACTTCACTGAATTTCTTCAAAACCATTGCAGAAGTGAGGTCATCCTTGAAA 
f&. AGTGGCAGAAAAAGAAAATAGCAGAGATGGGTCCAGTGCAGTGGCTTGCCACTCAAAAGAAGGCA 
"0 GCAGACAAAGTCGTCTTGCTTCTTTCCAATGACGTCAACAGTGTGTGGGATGGTACCTGTGGCAA 
Rl GAGCGAGGGCAGTCCCAGTGAGAACTCTCAAGACCTCTTCCCCCTTGCCTTTAACCTTTTCTGCA 
GT GAT C T AAGAAGC CAGAT T CAT C T GC AC AAAT AC G T GG T GG T CT AC T T T AGAGAGAT T GAT AC A 
AAAGACGAT T AC AATGC T CT CAGT GT C T GC C CCAAG T AC CAC C T CAT GAAGGAT GCCAC T GC TT T 
CTGTGCAGAACT TCTCCATGTCAAGCAGCAGGTGTCAGCAGGAAAAAGATCACAAGCCTGCCACG 
ATGGCTGCTGCTCCTTGTAG 
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FIGURE is8 



Sis 

y in 



111 



MSLVLLSLAALCRSAVPREPTVQCGSETGPSPEWMLQHDLIPGDLRDLRVEPVTTSVATGDYSILMNVSWV 
LRADASIRLLKATKICVTGKSNFQSYSCVRCNYTEAFQTQTRPSGGKWTFSYIGFPVELNTVYFIGAHNIP 
NAI^NEDGPSMSWFTSPGCLDHIMKYKKKCVKAGSL^ 

ST I IGFSQVFEPHQKKQTRASVVI PVTGDSEGAT VQLT PY FPTCGSDC IRHKGT WLC PQTGVP FPLDNNK 
SKPGGWLPLLLLSLLVATWVLVAGIYLMWRHERIKKTSFSTTTLLPPIECVLWYPSEICFHHTICY FTEFL 
QNHCRSEVILEKWQKKKIAEMGPVQWIATQKKAADPCWFLLSNDWSVCDGTCGKSEGSPSENSQDLFPLA 
FNLFCSDLRSQIHLHKYVWYFREIDTKDDYNALSVCPKYHLMKDATAFCAELLHVKQQVSAGKRSQACHD 
GCCSL 

Important features of the protein: 
Signal peptide: 
amino acids 1-14 

Transmembrane domain: 

amino acids 290-309 

N-glycosylation sites. 

amino acids 67 - 71, 103 - 107, 156 - 160, 183 - 187, 197 - 201 and 283 
- 287 



||| 

fj cAMP- and cGMP-dependent protein kinase phosphorylation sites. 

fll amino acids 228 - 232 and 319 - 323 

Casein kinase II phosphorylation sites. 

amino acids 178 - 182, 402 - 406, 414 - 418 and 453 - 457 

N-myristoylation site. 

amino acids 116-122 

Amidation site. 

amino acids 488-452 



APP ID=1 0063735 



Page 312 of 322 



FIGURE 15Q 



AGC CACCAGCGCAAC ATGA CAGT GAAGACCC TGCATGGCCCAGCCAT GGTCAAGTAC T T GC TGCT 
GTCGATATTGGGGCTTGCCTTTCTGAGTGAGGCGGCAGCTCGGAAAATCCCCAAAGTAGGACATA 
CTTTTTTCCAAAAGCCTGAGAGTTGCCCGCCTGTGCCAGGAGGTAGTATGAAGCTTGACATTGGC 
ATCATCAATGAAAACCAGCGCGTTTCCATGTCACGTAACATCGAGAGCCGCTCCACCTCCCCCTG 
GAAT T ACAC T GTCAC T TGGGACCC CAACCGGTACCCC TCGGAAG T TGTACAGGCC CAGT G TAGGA 
ACTTGGGCTGCATCAATGCTCAAGGAAAGGAAGACATCTCCATGAATTCCGTTCCCATCCAGCAA 
GAGACCCTGGTCGTCCGGAGGAAGCACCAAGGCTGCTCTGTTTCTTTCCAGTTGGAGAAGGTGCT 
GGTGACTGTTGGCTGCACCTGCGTCACCCCTGTCATCCACCATGTGCAG TAAG AGGTGCATATCC 
AC T C AGC T GAAGAAG . 



if! 
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FIGURE 160 

MTVKTLHGPAMVKYLLLSILGLAFLSEAAARKIPKVGHTFFQKPESCPPVPGGSMKLDIGIINEN 
QRVSMSRNIESRSTSPWNYTVTWDPNRYPSEWQAQCRNLGCINAQGKEDISMNSVPIQQETLW 
RRKHQGCS VS FQLEPCVL VT VGC TCVT PVI HHVQ 

Signal sequence: 

amino acids 1-30 

N-glycosylation site. 

amino acids 83-87 

N-rayristoylation sites. 

amino acids 106-111, 136-141 
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FIGURE 161 



ACAC T GGC CAAACAAAAACGAAAGCACT CCGTGC T GGAAG TAGGAGGAGAG TCAGGACTCCCAGG 
ACAGAGAGTGCACAAACTACCCAGCACAGCCCCCTCCGCCCCCTCTGGAGGCTGAAGAGGGATTC 
CAGCCCCTGCCACCCACAGACACGGGCTGACTGGGGTGTCTGCCCCCCTTGGGGGGGGGCAGCAC 
AGGGCCTCAGGCCTGGGTGCCACCTGGCAGCTAGAAGATGCCTGTGCCCTGGTTCTTGCTGTCCT 
TGGCACTGGGCCGAAGCCCAGTGGTCCTTTCTCTGGAGAGGCTTGTGGGGCCTCAGGACGCTACC 
CACTGCTCTCCGGGCCTCTCCTGCCGCCTCTGGGACAGTGACATACTCTGCCTGCCTGGGGACAT 
CGTGCCTGCTCCGGGCCCCGTGCTGGCGCCTACGCACCTGCAGACAGAGCTGGTGCTGAGGTGCC 
AGAAGGAGACCGACTGTGACCTCTGTCTGCGTGTGGCTGTCCACTTGGCCGTGCATGGGCACTGG 
GAAGAGCC T GAAGAT GAGGAAAAG TT TG GAGGAGCAGC TGAC T CAGGGGT GGAGGAGCC TAGGAA 
TGCCTCTCTCCAGGCCCAAGTCGTGCTCTCCTTCCAGGCCTACCCTACTGCCCGCTGCGTCCTGC 
TGGAGGTGCAAGTGCCTGCTGCCCTTGTGCAGTTTGGTCAGTCTGTGGGCTCTGTGGTATATGAC 
TGCTTCGAGGCTGCCCTAGGGAGTGAGGTACGAATCTGGTCCTATACTCAGCCCAGGTACGAGAA 
H. GGAACTCAACCACACACAGCAGCTGCCTGCCCTGCCCTGGCTCAACGTGTCAGCAGATGGTGACA 
Q ACGTGCATCTGGTTCTGAATGTCTCTGAGGAGCAGCACTTCGGCCTCTCCCTGTACTGGAATCAG 
f| . GTCCAGGGCCCCCCAAAACCCCGGTGGCACAAAAACCTGACTGGACCGCAGATCATTACCTTGAA 
5' CCACACAGACCTGGTTCCCTGCCTCTGTATTCAGGTGTGGCCTCTGGAACCTGACTCCGTTAGGA 
fl CGAACATCTGCCCCTTCAGGGAGGACCCCCGCGCACACCAGAACCTCTGGCAAGCCGCCCGACTG 
W CGACTGCTGACCCTGCAGAGCTGGCTGCTGGACGCACCGTGCTCGCTGCCCGCAGAAGCGGCACT 
Sj GTGCTGGCGGGCTCCGGGTGGGGACCCCTGCCAGCCACTGGTCCCACCGCTTTCCTGGGAGAACG 
hi TCACTGTGGACAAGGTTCTCGAGTTCCCATTGCTGAAAGGCCACCCTAACCTCTGTGTTCAGGTG 
if« AACAGCTCGGAGAAGCTGCAGCTGCAGGAGTGCTTGTGGGCTGACTCCCTGGGGCCTCTCAAAGA 
¥! CGATGTGCTACTGTTGGAGACACGAGGCCCCCAGGACAACAGATCCCTCTGTGCCTTGGAACCCA 
? GTGGCTGTACTTCACTACCCAGCAAAGCCTCCACGAGGGCAGCTCGCCTTGGAGAGTACTTACTA 
0 CAAGACCTGCAGTCAGGCCAGTGTCTGCAGCTATGGGACGATGACTTGGGAGCGCTATGGGCCTG 
|f! CCCCATGGACAAATACATCCACAAGCGCTGGGCCCTCGTGTGGCTGGCCTGCCTACTCTTTGCCG 
J* CTGCGCTTTCCCTCATCCTCCTTCTCAAAAAGGATCACGCGAAAGGGTGGCTGAGGCTCTTGAAA 
|% CAGGACGTCCGCTCGGGGGCGGCCGCCAGGGGCCGCGCGGCTCTGCTCCTCTACTCAGCCGATGA 
W CTCGGGTTTCGAGCGCCTGGTGGGCGCCCTGGCGTCGGCCCTGTGCCAGCTGCCGCTGCGCGTGG 
p CCGTAGACCTGTGGAGCCGTCGTGAACTGAGCGCGCAGGGGCCCGTGGCTTGGTTTCACGCGCAG 
f|f CGGCGCCAGACCCTGCAGGAGGGCGGCGTGGTGGTCTTGCTCTTCTCTCCCGGTGCGGTGGCGCT 
GTGCAGCGAGTGGCTACAGGATGGGGTGTCCGGGCCCGGGGCGCACGGCCCGCACGACGCCTTCC 
GCGCCTCGCTCAGCTGCGTGCTGCCCGACTTCTTGCAGGGCCGGGCGCCCGGCAGCTACGTGGGG 
GCCTGCTTCGACAGGCTGCTCCACCCGGACGCCGTACCCGCCCTTTTCCGCACCGTGCCCGTCTT 
CACACTGCCCTCCCAACTGCCAGACTTCCTGGGGGCC'CTGCAGCAGCCTCGCGCCCCGCGTTCCG 
GGCGGCTCCAAGAGAGAGCGGAGCAAGTGTCCCGGGCCCTTCAGCCAGCCCTGGATAGCTACTTC 
CATCCCCCGGGGACTCCCGCGCCGGGACGCGGGGTGGGACCAGGGGCGGGACCTGGGGCGGGGGA 
CGGGACTTAAATAAAGGCAGACGCTGTTTTTCTAAAAAAA 



APPJD-l 0063735 



Page 315 of 322 



FIGURE 162 



MPVPWFLLSLALGRSPWLSLERLVGPQDATHCSPGLSCRLWDSDILCLPGDIVPAPGPVLAPTHLQTELV 
LRCQKET DCDLCLRVAVHLAVHGHWE E PE DEE KFGGAADSGVE E PRNAS LQAQVVLS FQAY PTARC VLLEV 
QVPAALVQFGQSVGSVVYDCFEAALGSEVRIWSYTQPRYEKELNHTQQLPALPWLNVSADGDNVHLVLNVS 
EEQHFGLSLYWNQVQGPPKPRWHKNLTGPQIITLNHTDLVPCLCIQVWPLEPDSVRTNICPFREDPRAHQN 
LWQAARLRLLTLQSWLLDAPCSLPAEAALCWRAPGGDPCQPLVPPLSWENVTVDKVLEFPLLKGHPNLCVQ 
VNSSEKXQLQECLWADSLGPLKDDVLLLETRGPQDNRSLCALEPSGCTSLPSKASTRAARLGEYLLQDLQS 
GQCLQLWDDDLGALWACPMDKYIHKRWALVWLACLLFAAALSLILLLKKDHAKGWLRLLKQDVRSGAAARG 
RAALLLYSADDSGFERLVGALASALCQLPLRVAVDLWSRRELSAQGPVAWFHAQRRQTLQEGGYWLLFSP 
GAVALC SE WLQDGVSGPGAHGPHDAFEASL SC VL PDFLQGRAPGS Y VGAC FD RLLH PDAVPAL FRT VP VFT 
LPSQLPDFLGALQQPRAPRSGRLQERAEQVSRALQPALDSYFHPPGTPAPGRGVGPGAGPGAGDGT 

Signal sequence: 

amino acids 1-20 

Transmembrane domain. 

amino acids 453-475 

N-glycosylation sites. 

amino acids 118-121, 186-189, 198-201, 211-214, 238-241, 248-251, 
334-337, 357-360, 391-394 

Glycosaminoglycan attachment site. 

amino acids 583-586 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 552-555 

N-myristoylation sites. 

amino acids 107-112, 152-157, 319-324, 438-443, 516-521, 612-617, 
692-697, 696-701, 700-705 
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FIGURE i6s 



GGGAGGGCTCTGTGCCAGCCCCGATGAGGACGCTGCTGACCATCTTGACTGTGGGATCCCTGGCT 
GCTCACGCCCCTGAGGACCCCTCGGATCTGCTCCAGCACGTGAAATTCCAGTCCAGCAACTTTGA 
AAAC AT CC T GAC G TGGGACAGCGGGCCAGAGGGCACCCCAGACACGGT C TACAGC ATCGAGTATA 
AGACGTACGGAGAGAGGGACTGGGTGGCAAAGAAGGGCTGTCAGCGGATCACCCGGAAGTCCTGC 
AACCTGACGGTGGAGACGGGCAACCTCACGGAGCTCTACTATGCCAGGGTCACCGCT 
GTCAGTGCGGGAGGCCGGTCAGCCACCAAGATGACTGACAGGTTCAGCTCTCTGCAGCACACTAC 
CC T CAAGC C ACC T GAT GTGAC CT G TAT CTCC AAAGT GAGAT CGAT T CAGATGAT T G T TC AT CC TA 
CCCCCACGCCAATCCGTGCAGGCGATGGCCACCGGCTAACCCTGGAAGACATCTTCCATGACCTG 
T T C T AC CAC T TAGAGCT CCAGGT CAACCGCACC TACCAAAT GCACC TT GGAGGGAAGCAGAGAGA 
ATATGAGTTCTTCGGCCTGACCCCTGACACAGAGTTCCTTGGCACCATCATGATTTGCGTTCCCA 
CCTGGGCCAAGGAGAGTGCCCCCTACATGTGCCGAGTGAAGACACTGCCAGACCGGACATGGACC 
■^J TACTCCTTCTCCGGAGCCTTCCTGTTCTCCATGGGCTTCCTCGTCGCAGTACTCTGCTACCTGAG 
Q C T ACAGAT AT G T CAC CAAGC C GCC TGCACC T CCCAACTCCCTGAACGT CCAG CGAG TCC TGAC T T 

jr| TCCAGCCGCTGCGCTTCATCCAGGAGCACGTCCTGATCCCTGTCTTTGACCTCAGCGGCCCCAGC 
m AGTCTGGCCCAGCCTGTCCAGTACTCCCAGATCAGGGTGTCTGGACCCAGGGAGCCCGCAGGAGC 
TCCACAGCGGCATAGCCTGTCCGAGATCACCTACTTAGGGCAGCCAGACATCTCCATCCTCCAGC 
CCTCCAACGTGCCACCTCCCCAGATCCTCTCCCCACTGTCCTATGCCCCAAACGCTGCCCCTGAG 
SI GTCGGGCCCCCATCCTATGCACCTCAGGTGACCCCCGAAGCTCAATTCCCATTCTACGCCCCACA 
h| GGCCATCTCTAAGGTCCAGCCTTCCTCCTATGCCCCTCAAGCCACTCCGGACAGCTGGCCTCCCT 
CCTATGGGGTATGCATGGAAGGTTCTGGCAAAGACTCCCCCACTGGGACACTTTCTAGTCCTAAA 
CAC CT T AGGCC TAAAGGTCAGCTT CAGAAAGAGCCACCAGC TGGAAGCT GCAT GT T AGGT GGCCT 
TTCTCTGCAGGAGGTGACCTCCTTGGCTATGGAGGAATCCCAAGAAGCAAAATCATTGCACCAGC 
CCC TGGGGAT T TGCACAGACAGAACAT C TGACCCAAATG TGC TACACAG TGG GGAGGAAGGGACA 
CCACAGTACCTAAAGGGCCAGCTCCCCCTCCTCTCCTCAGTCCAGATCGAGGGCCACCCCATGTC 
CCTCCCTTTGCAACCTCCTTCCGGTCCATGTTCCCCCTCGGACCAAGGTCCAAGTCCCTGGGGCC 
TGCTGGAGTCCCTTGTGTGTCCCAAGGATGAAGCCAAGAGCCCAGCCCCTGAGACCTCAGACCTG 
W GAGCAGCCCACAGAACTGGATTCTCTTTTCAGAGGCCTGGCCCTGACTGTGCAGTGGGAGTCCTG 
(J AGGGGAATGGGAAAGGCTTGGTGCTTCCTCCCTGTCCCTACCCAGTGTCACATCCTTGGCTGTCA 
f|| ATCCCATGCCTGCCCATGCCACACACTCTGCGATCTGGCCTCAGACGGGTGCCCTTGAGAGAAGC 
AGAGGGAGT GGCATGCAGGGCCCCTGCCAT GGGTGCGC TCC TCACCGGAACAAAGCAGCATGATA 
AGGACTGCAGCGGGGGAGCTCTGGGGAGCAGCTTGTGTAGACAAGCGCGTGCTCGCTGAGCCCTG 
CAAGGCAGAAATGACAGTGCAAGGAGGAAATGCAGGGAAACTCCCGAGGTCCAGAGCCCCACCTC 
CTAACACCATGGATTCAAAGTGCTCAGGGAATTTGCCTCTCCTTGCCCCATTCCTGGCCAGTTTC 
AC AAT CT AGC T CGACAGAGC AT GAGGCC CC T GCC T C T TC T GTCAT T GT T CAAAGG T GGGAAGAGA 
GCCTGGAAAAGAACCAGGCCTGGAAAAGAACCAGAAGGAGGCTGGGCAGAACCAGAACAACCTGC 
ACTTCTGCCAAGGCCAGGGCCAGCAGGACGGCAGGACTCTAGGGAGGGGTGTGGCCTGCAGCTCA 
TTCCCAGCCAGGGCAACTGCCTGACGTTGCACGATTTCAGCTTCATTCCTCTGATAGAACAAAGC 
GAAAT GCAGGTCCACCAGGGAGGGAGACACACAAGCCT TT TCTGCAGGCAGGAGT T TCAGACCC T 
ATCCTGAGAATGGGGTTTGAAAGGAAGGTGAGGGCTGTGGCCCCTGGACGGGTACAATAACACAC 
TGTACTGATGTCACAACTTTGCAAGCTCTGCCTTGGGTTCAGCCCATCTGGGCTCAAATTCCAGC 
CTCACCACTCACAAGCTGTGTGACTTCAAACAAATGAAATCAGTGCCCAGAACCTCGGTTTCCTC 
AT C TG T AAT GTGGGGAT CATAACACCTACC T CATGGAGT TGTGGT GAAGATGAAATGAAGTCAT G 
TCTTTAAAGTGCTTAATAGTGCCTGGTACATGGGCAGTGCCCAATAAACGGTAGCTATTTAAAAA 
AAAAAAAA 
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FIGURE 16a 

MRTLLTILTVGSIJU\HAPEDPSDLLQHVKFQSSNFENILTWDSGPEGTPDTVYSISYKTYGERDW 
VAKKGCQRITRKSCNLTVETGNLTELYYARVTAVSAGGRSATKMTDRFSSLQHTTLKPPDVTCIS 
KVRSIQMIVHPTPTPIRAGDGHRLTLEDIFHDLFYHLELQVNRTYQMHLGGKQREYEFFGLTPDT 
EFLGTIMICVPTWAKESAPYMCRVKTLPDRTWTYSFSGAFLFSMGFLVAVLCYLSYRYVTKPPAP 
PNSLNVQRVLTFQPLRFIQEHVLIPVFDLSGPSSLAQPVQYSQIRVSGPREPAGAPQRHSLSEIT 
YLGQPDISILQPSNVPPPQILSPLSYAPNAAPEVGPPSYAPQVTPEAQFPFYAPQAISKVQPSSY 
APQATPDSWPPSYGVCMEGSGKDSPTGTLSSPKHLRPKGQLQKEPPAGSCMLGGLSLQEVTSLAM 
EESQEAKSLHQPLGICTDRTSDPNVLHSGEEGTPQYLKGQLPLLSSVQIEGHPMSLPLQPPSGPC 
SPSDQGPSPWGLLESLVCPKDEAKSPAPETSDLEQPTELDSLFRGLALTVQWES 

Signal sequence. 

amino acids 1-17 

Transmembrane domain. 

amino acids 233-250 

N-glycosylation sites, 

amino acids 80-83, 87-90, 172-175 

N-myristoylation sites. 

amino acids 11-16, 47-52, 102-107, 531-536, 565-570 
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FIGURE i6s 



TGGCCTACTGGAAAAAAAAAAAAAAAAAAAAAAAGTCACCCGGGCCCGCGGTGGCCACAACATGG 
CTGCGGCGCCGGGGCTGCTCTTCTGGCTGTTCGTGCTGGGGGCGCTCTGGTGGGTCCCGGGCCAG 
TCGGATCTCAGCCACGGACGGCGTTTCTCGGACCTCAAAGTGTGCGGGGACGAAGAGTGCAGCAT 
GT TAAT GTACCG TGGGAAAGC T C T T GAAG AC T TCACGGGCC CT GAT TGTCGTTTTGT GAAT T T TA 
AAAAAGGTGACGATGTATATGTCTACTACAAACTGGCAGGGGGATCCCTTGAACTTTGGGCTGGA 
AGT GT T GAACACAGT T T TGGAT AT T T T CCAAAAGAT T T GAT CAAGGT AC T T CATAAATACACGGA 
AGAAG AGCT ACAT AT TC CAGCAG AT GAGACAGAC TTTGTCTGCTTT GAAGGAGGAAGAGAT GAT T 
T TAAT AGT TATAATGTAGAAGAGC T T T TAGGATC T T TGGAACT GGAGGACT C TGTACC T GAAGAG 
T CGAAGAAAGCT GAAGAAGT TT C TCAGCACAGAGAGAAATC TCC T GAGGAGTC TCGGGGGCGT GA 
ACT TGACCCT GTGCCTGAGCCCGAGGCAT TCAGAGCTGAT TCAGAGGAT GGAGAAGGTGC TT TC T 
C AGAGAGCACCGAGGGGC T GCAGGGACAGCCC TC AGC T CAGGAGAGCCACCCT CACACCAGC GG T 
CCTGCGGCT7VACGCTCAGGGAGTGCAGTCTTCGTTGGACACTTTTGAAGAAATTCTGCACGATAA 
ATTGAAAGTGCCGGGAAGCGAAAGCAGAACTGGCAATAGTTCTCCTGCCTCGGTGGAGCGGGAGA 
AGACAGAT GCT T ACAAAGT CCTGAAAACAGAAAT GAG TCAGAGAGGAAGTGGACAGT GC GT TAT T 
CAT T ACAGCAAAGGAT T TCGTTGGCATCAAAATC TAAGTTTGTT T T ACAAAGAT TGT T T T TAGTA 
CTAAGCTGCCTTGGCAGTTTGCATTTTTGAGCCAAACAAAAATATATTATTTTCCCTTCTAAGTA 
AAAAAAAAAAAAAAAAAAAA 
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FIGURE 166 

MAAAPGLLFWLFVLGALWWPGQSDLSHGRRFSDLKVCGDEECSMmYRGKALEDFTGPDCREW 
FKKGDDVYVYYKLAGGSLELWAGS VEHS FGY FPKDL I KVLHKYTEEELH I PADETD FVC FEGGRD 
DFNSYNVEELLGSLELEDSVPEESKKAEEVSQHREKSPEESRGRELDPVPEPEAFRADSEDGEGA 
FSESTEGLQGQPSAQESHPHTSGPAANAQGVQSSLDTFEEILHDKLKVPGSESRTGNSSPASVER 
EKTDAYKVLKTEMSQRGSGQCVIHYSKGFRWHQNLSLFYKDCF 



~3? 



Important features of the protein: 
Signal peptide: 

amino acids 1-22 



N-glycosylation site. 

%J amino acids 294-298 



cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 30-34 



Q Tyrosine kinase phosphorylation site. 

PI amino acids 67-7 6 

iH N-myristoylation sites. 

amino acids 205-211, 225-231, 277-283 



Amidation site. 

amino acids 28-32 
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FIGURE 167 



CCAGGACCAGGGCGCACCGGCTCAGCCTCTCACTTGTCAGAGGCCGGGGAAGAGAAGCAAAGCGC 
AACGGTGTGGTCCAAGCCGGGGCTTCTGCTTCGCCTCTAGGACATACACGGGACCCCCTAACTTC 
AGTCCCCCAAACGCGCACCCTCGAAGTCTTGAACTCCAGCCCCGCACATCCACGCGCGGCACAGG 
CGCGGCAGGCGGCAGGTCCCGGCCGAAGGCGATGCGCGCAGGGGGTCGGGCAGCTGGGCTCGGGC 
GGCGGGAGTAGGGCCCGGCAGGGAGGCAGGGAGGCTGCATATTCAGAGTCGCGGGCTGCGCCCTG 
GGCAGAGGCCGCCCTCGCTCCACGCAACACCTGCTGCTGCCACCGCGCCGCGATGAGCCGCGTGG 
TCTCGCTGCTGCTGGGCGCCGCGCTGCTCTGCGGCCACGGAGCCTTCTGCCGCCGCGTGGTCAGC 
GGCCAAAAGGTGTGTTTTGCTGACTTCAAGCATCCCTGCTACAAAATGGCCTACTTCCATGAACT 
GTCCAGCCGAGTGAGCTTTCAGGAGGCACGCCTGGCTTGTGAGAGTGAGGGAGGAGTCCTCCTCA 
GCCTTGAGAATGAAGCAGAACAGAAGTTAATAGAGAGCATGTTGCAAAACCTGACAAAACCCGGG 
ACAGGGATTTCTGATGGTGATTTCTGGATAGGGCTTTGGAGGAATGGAGATGGGCAAACATCTGG 
TGCCTGCCCAGATCTCTACCAGTGGTCTGATGGAAGCAATTCCCAGTACCGAAACTGGTACACAG 
ATGAACCTTCCTGCGGAAGTGAAAAGTGTGTTGTGATGTATGACCAACCAACTGCCAATCCTGGC 
CTTGGGGGTCCCTACCTTTACCAGTGGAATGATGACAGGTGTAACATGAAGCACAATTATATTTG 
CAAGTATGAACCAGAGATTAATCCAACAGCCCCTGTAGAAAAGCCTTATCTTACAAATCAACCAG 
GAGACACCCATCAGAATGTGGTTGTTACTGAAGCAGGTATAATTCCCAATCTAATTTATGTTGTT 
ATACCAACAATACCCCTGCTCTTACTGATACTGGTTGCTTTTGGAACCTGTTGTTTCCAGATGCT 
GCAT AAAAGTAAAGGAAGAACAAAAACT AG TC CAAACCAGT C T ACAC TGTGGAT T TCAAAGAGTA 
CCAGAAAAGAAAGTGGCATGGAAGTATAATAACTCATTGACTTGGTTCCAGAATTTTGTAATTCT 
GGATCTGTATAAGGAATGGCATCAGAACAATAGCTTGGAATGGCTTGAAATCACAAAGGATCTGC 
AAGATGAACTGTAAGCTCCCCCTTGAGGCAAATATTAAAGTAATTTTTATATGTCTATTATTTCA 
TTTAAAGAATATGCTGTGCTAATAATGGAGTGAGACATGCTTATTTTGCTAAAGGATGCACCCAA 

AC T T CAAAC T T CAAGCAAAT GAAATGGACAAT GCAGATAAAGT TGT TAT CAACACGTCGGGAG TA 
TGTGTGTTAGAAGCAATTCCTTTTATTTCTTTCACCTTTCATAAGTTGTTATCTAGTCAATGTAA 
TGTATATTGTATTGAAATTTACAGTGTGCAAAAGTATTTTACCTTTGCATAAGTGTTTGATAAAA 
ATGAACTGTTCTAATATTTATTTTTATGGCATCTCATTTTTCAATACATGCTCTTTTGATTAAAG 

aaacttattactgttgtcaactgaattcacacacacacaaatatagtaccatagaaaaagtttgt 
tttctcgaaataattcatctttcagcttctctgcttttggtcaatgtctaggaaatctcttcaga 
aataagaagctatttcattaagtgtgatataaacctcctcaaacattttacttagaggcaaggat 
tgtctaatttcaattgtgcaagacatgtgccttataattatttttagcttaaaattaaacagatt 
ttgtaataatgtaactttgttaataggtgcataaacactaatgcagtcaatttgaacaaaagaag 
tgacatacacaatataaatcatatgtcttcacacgttgcctatataatgagaagcagctctctga 
gggttctgaaatcaatgtggtccctctcttgcccactaaacaaagatggttgttcgGggtttggg 
attgacactggaggcagatagttgcaaagttagtctaaggtttccctagctgtatttagcctctg 

ACTATATTAGTATACAAAGAGGTCATGTGGTTGAGACCAGGTGAATAGTCACTATCAGTGTGGAG 
ACAAGCACAGCAGACAGACAT T T T AGGAAGGAAAGGAACTACGAAAT CGTG T GAAAATGGGT T GG 
AACCCATCAGTGATCGCATATTCATTGATGAGGGTTTGCTTGAGATAGAAAATGGTGGCTCCTTT 
CTGTCTTATCTCCTAGTTTCTTCAATGCTTACGCCTTGTTCTTCTCAAGAGAAAGTTGTAACTCT 
CTGGTCTTCATATGTCCCTGTGCTCCTTTTAACCAAATAAAGAGTTCTTGTTTCTGGGGGAAAAA 
AAAAAAAAAAAAAAAAAZWW^AA?y\AAAAAAAAA 
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FIGURE 168 



MSRWSLLLGAALLCGHGAFCRRWSGQKVCFADFKHPCYKMAYFHELSSRVSFQEARLACESE 
GGVLLSLENEAEQKLIESMLQNLTKPGTGISDGDFWIGLWRNGDGQTSGACPDLYQWSDGSNSQ 
YRNWYTDE PSCGSEKCWMYHQPTANPGLGGPYLYQWNDDRCNMKHNYI CKYEPE I NPTAPVEK 
P YLTNQPGDTHQNWVTEAG 1 1 PNL I YWI PTIPLLLLI LVAFGTCCFQMLHKSKGRTKT S PNQ 
STLWISKS TRKE S GME V 

Important features of the protein: 
Signal peptide: 

amino acids 1-21 

k~. 

III Transmembrane domain: 

III amino acids 214-235 

Iff N-glycosylation sites. 

amino acids 86-89, 255-258 

Wl 
m 

Sf cAMP- and cGMP -dependent protein kinase phosphorylation site. 

%y 

Q amino acids 2 66-269 

N-myristoylation sites. 

amino acids 27-32, 66-71, 91-96, 93-98, 102-107/ 109-114, 140- 
145, 212-217 
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